Skip to content
BurstPickBurstPick

ML Models

BurstPick ships with 22 ML models across 6 categories. Every model runs on-device using Apple CoreML and Neural Engine. No photos leave your Mac. Models are either bundled with the app or downloaded when you first select them.

100% On-DeviceOpen-Source ModelsNeural Engine Accelerated

Image Quality Assessment

Scores each photo on sharpness, noise, exposure, and perceptual clarity. Used to rank photos within burst clusters.

Heuristic (Laplacian + Luma)

Bundled
Built-in

Instant scoring using Accelerate vDSP/vImage. Measures sharpness, exposure, noise, and eye closure. Cannot judge composition or semantic quality. Best for fast initial culling passes.

Parameters: 0Input: 4096pxDownload: NoneMemory: 0 MB

TOPIQ NR

Apache 2.0

No-reference IQA using ResNet50 backbone. Good general-purpose technical quality scores. Good balanced option for quality assessment.

Parameters: 45.2MInput: 224pxDownload: 83 MBMemory: 100 MB

MUSIQ (KonIQ)

Apache 2.0

Multi-scale transformer trained on KonIQ-10k real-world distortions. Strong on natural photos with better perceptual alignment than TOPIQ.

Parameters: 27MInput: 224pxDownload: 50 MBMemory: 110 MB

MANIQA

Apache 2.0

NTIRE 2022 IQA Challenge winner. Multi-dimension attention captures fine perceptual differences. Most accurate but largest in category.

Parameters: 135.7MInput: 224pxDownload: 248 MBMemory: 140 MB

NIMA (MobileNet)

Apache 2.0

Neural Image Assessment trained on AVA (250K aesthetic ratings). Outputs 10-class probability distribution. Compact MobileNet backbone — fastest aesthetic quality model.

Parameters: 4.2MInput: 224pxDownload: 14 MBMemory: 20 MB

Aesthetic Scoring

Rates artistic appeal based on composition, color harmony, and visual balance. Trained on large-scale human preference data.

LAION Aesthetic v1

Bundled
MIT

Lightweight linear probe on CLIP embeddings — near-zero overhead if CLIP is loaded. Trained on LAION aesthetic ratings. Good default aesthetic scorer.

Parameters: 513Input: CLIPDownload: BundledMemory: 340 MB

ViT-B/16 Aesthetic

Apache 2.0

Standalone ViT-B/16 fine-tuned on AVA dataset (250K human aesthetic ratings). More nuanced aesthetic judgment than LAION probe. Independent of CLIP.

Parameters: 86MInput: 224pxDownload: 156 MBMemory: 350 MB

Image Embedding

Turns photos into vectors for similarity clustering. Groups burst sequences and flags duplicates or near-duplicates.

Apple Vision FeaturePrint

Bundled
Apple

Built into macOS — zero download, instant availability. Good general-purpose scene similarity. Best for speed-first workflows.

Parameters: SystemInput: 1024pxDownload: NoneMemory: System

DINOv2 ViT-S/14

Apache 2.0

State-of-the-art self-supervised features (Meta, LVD-142M). Excellent visual similarity and scene structure. Recommended balanced choice.

Parameters: 22.1MInput: 224pxDownload: 40 MBMemory: 88 MB

CLIP ViT-B/32

MIT / Apache 2.0

Rich semantic understanding from multimodal training. Groups photos by content meaning. Required by LAION Aesthetic scorer. Best for diverse libraries.

Parameters: 86MInput: 224pxDownload: 161 MBMemory: 340 MB

Face Embedding

Builds face identity vectors for person grouping. Clusters photos by who appears in them.

EdgeFace-XS

Apache 2.0

Fastest option — lightweight 4 MB download. Good face grouping for most photos (LFW 99.73%). Best when speed is the priority.

Parameters: 1.77MInput: 112pxDownload: 4 MBMemory: 8 MB

EdgeFace-S

Apache 2.0

Good balance of speed and accuracy (LFW 99.82%, IJB-B 94.38%). Small download. Handles varied lighting well. Recommended balanced choice.

Parameters: 3.65MInput: 112pxDownload: 7 MBMemory: 16 MB

AdaFace IR-18

Bundled
MIT

Strong on low-quality and challenging face crops via adaptive margin (CVPR 2022). LFW 99.82%. Good mid-tier choice.

Parameters: 24MInput: 112pxDownload: BundledMemory: 48 MB

AdaFace IR-50

MIT

Top-tier accuracy (LFW 99.82%, IJB-B 95.67%). Excels on difficult poses and low-quality crops. Best when face grouping precision is critical.

Parameters: 44MInput: 112pxDownload: 80 MBMemory: 90 MB

AuraFace v1

Apache 2.0

Large ResNet-100 backbone with permissive Apache 2.0 license. Choose mainly for licensing requirements.

Parameters: 65MInput: 112pxDownload: 119 MBMemory: 125 MB

GhostFaceNets

Apache 2.0

SOTA 2025 lightweight face recognition model. High performance with minimal computational overhead.

Parameters: ~2MInput: 112pxDownload: 2 MBMemory: 10 MB

Vision Language Model (VLM)

Reads photo content using natural language. Gives scene descriptions and quality reasoning that go beyond numerical scores.

Heuristic Estimate

Bundled
Built-in

Built-in fallback using heuristic image analysis (sharpness, exposure, noise, faces). No download required. Replace with a real VLM for improved results.

Parameters: 0Input: 512pxDownload: NoneMemory: 0 MB

SmolVLM2 256M

Apache 2.0

Smallest VLM — fastest inference with minimal memory. Basic scene recognition and quality commentary. Best for quick screening on constrained hardware.

Parameters: 256MInput: 384pxDownload: 465 MBMemory: 300 MB

SmolVLM2 2.2B

Apache 2.0

Full-size SmolVLM with strong scene understanding and quality reasoning. More capable but slower than 256M variant.

Parameters: 2.2BInput: 384pxDownload: 3.9 GBMemory: 2.2 GB

FastVLM 0.5B

Apache 2.0

Apple FastVLM with FastViTHD hybrid encoder. Optimized for on-device speed with solid scene recognition. Recommended balanced VLM choice.

Parameters: 0.5BInput: 512pxDownload: 1.4 GBMemory: 600 MB

FastVLM 1.5B

Apache 2.0

Largest and most capable VLM. Deep scene understanding, nuanced quality reasoning, and detailed descriptions. Best when VLM quality is the top priority.

Parameters: 1.5BInput: 768pxDownload: 3.5 GBMemory: 1.5 GB

Image Classification

Tags photos with scene and object labels for filtering and organization. Uses Apple's Vision framework.

Apple Vision Classification

Bundled
Apple

Built-in macOS image classification using VNClassifyImageRequest. Fast, no download required. Provides scene and object tags for filtering.

Parameters: SystemInput: AutoDownload: NoneMemory: System

All models run as CoreML on your Mac using CPU, GPU, and Neural Engine. Download sizes are for the compressed CoreML package. Memory figures are approximate GPU/ANE usage during inference.