Recommended models
Three tiers based on your available RAM and quality needs.
Vision-capable at just 4B parameters. Handles printed text, receipts, and simple documents well.
Excellent vision understanding. Handles complex layouts, tables, and handwritten text.
Maximum accuracy for complex multi-page documents. Needs significant RAM.