What photo formats are supported?

26 RAW formats (ARW, CR2, CR3, NEF, DNG, ORF, RW2, RAF, and more), plus HEIC, HEIF, JPG, PNG, TIFF, AVIF, WebP, and JPEG XL.

How much does BurstPick cost?

Free during the beta. After launch, Pro will be available as a one-time in-app purchase on the Mac App Store. No subscription, no recurring fees.

What are the system requirements?

Mac: macOS 14 (Sonoma) or later, Apple Silicon or Intel. Windows: Windows 10/11 (64-bit). Mac version launches first; Windows is coming soon.

Does it require an internet connection?

No. All ML inference runs on your device. No photos are sent anywhere.

How many ML models does it use?

20+ models covering image quality, aesthetics, similarity embeddings, face recognition, VLM assessment, and image classification. All swappable in settings.

Will it delete my photos?

No. BurstPick is non-destructive. Export creates hard links to save disk space; your originals stay untouched.

Does it work with Lightroom?

Yes. Export your Keep/Reject decisions and star ratings as XMP sidecar files, or export directly to a Lightroom Classic catalog.

A team of photographers and macOS engineers who shoot 10,000+ frames per session. We built it because we needed it. Existing tools didn't get burst workflows.

How does BurstPick fit into my workflow?

It's a companion, not a replacement. Ingest with Photo Mechanic, cull with BurstPick, edit in Lightroom. Full XMP compatibility, no extra steps.

Is BurstPick available yet?

Not yet. Sign up for updates and we'll let you know when it launches.

ML Models

BurstPick ships with 22 ML models across 6 categories. Every model runs on-device using Apple CoreML and Neural Engine. No photos leave your Mac. Models are either bundled with the app or downloaded when you first select them.

100% On-DeviceOpen-Source ModelsNeural Engine Accelerated

Image Quality Assessment

Scores each photo on sharpness, noise, exposure, and perceptual clarity. Used to rank photos within burst clusters.

Heuristic (Laplacian + Luma)

Bundled

Built-in

Instant scoring using Accelerate vDSP/vImage. Measures sharpness, exposure, noise, and eye closure. Cannot judge composition or semantic quality. Best for fast initial culling passes.

Parameters: 0Input: 4096pxDownload: NoneMemory: 0 MB

TOPIQ NR

Apache 2.0

No-reference IQA using ResNet50 backbone. Good general-purpose technical quality scores. Good balanced option for quality assessment.

Parameters: 45.2MInput: 224pxDownload: 83 MBMemory: 100 MB

MUSIQ (KonIQ)

Apache 2.0

Multi-scale transformer trained on KonIQ-10k real-world distortions. Strong on natural photos with better perceptual alignment than TOPIQ.

Parameters: 27MInput: 224pxDownload: 50 MBMemory: 110 MB

MANIQA

Apache 2.0

NTIRE 2022 IQA Challenge winner. Multi-dimension attention captures fine perceptual differences. Most accurate but largest in category.

Parameters: 135.7MInput: 224pxDownload: 248 MBMemory: 140 MB

NIMA (MobileNet)

Apache 2.0

Neural Image Assessment trained on AVA (250K aesthetic ratings). Outputs 10-class probability distribution. Compact MobileNet backbone — fastest aesthetic quality model.

Parameters: 4.2MInput: 224pxDownload: 14 MBMemory: 20 MB

Aesthetic Scoring

Rates artistic appeal based on composition, color harmony, and visual balance. Trained on large-scale human preference data.

LAION Aesthetic v1

Bundled

MIT

Lightweight linear probe on CLIP embeddings — near-zero overhead if CLIP is loaded. Trained on LAION aesthetic ratings. Good default aesthetic scorer.

Parameters: 513Input: CLIPDownload: BundledMemory: 340 MB

ViT-B/16 Aesthetic

Apache 2.0

Standalone ViT-B/16 fine-tuned on AVA dataset (250K human aesthetic ratings). More nuanced aesthetic judgment than LAION probe. Independent of CLIP.

Parameters: 86MInput: 224pxDownload: 156 MBMemory: 350 MB

Image Embedding

Turns photos into vectors for similarity clustering. Groups burst sequences and flags duplicates or near-duplicates.

Apple Vision FeaturePrint

Bundled

Apple

Built into macOS — zero download, instant availability. Good general-purpose scene similarity. Best for speed-first workflows.

Parameters: SystemInput: 1024pxDownload: NoneMemory: System

DINOv2 ViT-S/14

Apache 2.0

State-of-the-art self-supervised features (Meta, LVD-142M). Excellent visual similarity and scene structure. Recommended balanced choice.

Parameters: 22.1MInput: 224pxDownload: 40 MBMemory: 88 MB

CLIP ViT-B/32

MIT / Apache 2.0

Rich semantic understanding from multimodal training. Groups photos by content meaning. Required by LAION Aesthetic scorer. Best for diverse libraries.

Parameters: 86MInput: 224pxDownload: 161 MBMemory: 340 MB

Face Embedding

Builds face identity vectors for person grouping. Clusters photos by who appears in them.

EdgeFace-XS

Apache 2.0

Fastest option — lightweight 4 MB download. Good face grouping for most photos (LFW 99.73%). Best when speed is the priority.

Parameters: 1.77MInput: 112pxDownload: 4 MBMemory: 8 MB

EdgeFace-S

Apache 2.0

Good balance of speed and accuracy (LFW 99.82%, IJB-B 94.38%). Small download. Handles varied lighting well. Recommended balanced choice.

Parameters: 3.65MInput: 112pxDownload: 7 MBMemory: 16 MB

AdaFace IR-18

Bundled

MIT

Strong on low-quality and challenging face crops via adaptive margin (CVPR 2022). LFW 99.82%. Good mid-tier choice.

Parameters: 24MInput: 112pxDownload: BundledMemory: 48 MB

AdaFace IR-50

MIT

Top-tier accuracy (LFW 99.82%, IJB-B 95.67%). Excels on difficult poses and low-quality crops. Best when face grouping precision is critical.

Parameters: 44MInput: 112pxDownload: 80 MBMemory: 90 MB

AuraFace v1

Apache 2.0

Large ResNet-100 backbone with permissive Apache 2.0 license. Choose mainly for licensing requirements.

Parameters: 65MInput: 112pxDownload: 119 MBMemory: 125 MB

GhostFaceNets

Apache 2.0

SOTA 2025 lightweight face recognition model. High performance with minimal computational overhead.

Parameters: ~2MInput: 112pxDownload: 2 MBMemory: 10 MB

Vision Language Model (VLM)

Reads photo content using natural language. Gives scene descriptions and quality reasoning that go beyond numerical scores.

Heuristic Estimate

Bundled

Built-in

Built-in fallback using heuristic image analysis (sharpness, exposure, noise, faces). No download required. Replace with a real VLM for improved results.

Parameters: 0Input: 512pxDownload: NoneMemory: 0 MB

SmolVLM2 256M

Apache 2.0

Smallest VLM — fastest inference with minimal memory. Basic scene recognition and quality commentary. Best for quick screening on constrained hardware.

Parameters: 256MInput: 384pxDownload: 465 MBMemory: 300 MB

SmolVLM2 2.2B

Apache 2.0

Full-size SmolVLM with strong scene understanding and quality reasoning. More capable but slower than 256M variant.

Parameters: 2.2BInput: 384pxDownload: 3.9 GBMemory: 2.2 GB

FastVLM 0.5B

Apache 2.0

Apple FastVLM with FastViTHD hybrid encoder. Optimized for on-device speed with solid scene recognition. Recommended balanced VLM choice.

Parameters: 0.5BInput: 512pxDownload: 1.4 GBMemory: 600 MB

FastVLM 1.5B

Apache 2.0

Largest and most capable VLM. Deep scene understanding, nuanced quality reasoning, and detailed descriptions. Best when VLM quality is the top priority.

Parameters: 1.5BInput: 768pxDownload: 3.5 GBMemory: 1.5 GB

Image Classification

Tags photos with scene and object labels for filtering and organization. Uses Apple's Vision framework.

Apple Vision Classification

Bundled

Apple

Built-in macOS image classification using VNClassifyImageRequest. Fast, no download required. Provides scene and object tags for filtering.

Parameters: SystemInput: AutoDownload: NoneMemory: System