Skip to content

Instantly share code, notes, and snippets.

@axsaucedo
Created December 13, 2025 14:43
Show Gist options
  • Select an option

  • Save axsaucedo/ae8a0e2f5a2ea1900ae2427ba78fd908 to your computer and use it in GitHub Desktop.

Select an option

Save axsaucedo/ae8a0e2f5a2ea1900ae2427ba78fd908 to your computer and use it in GitHub Desktop.

Convert Multiple PDFs to Markdown in Mac

Option 1

When the PDF text is available, microsoft's Markitdown is fastest.

Install:

pip install markitdown[pdf]

Run:

for i in $(find ./folder/* -type f -name '*.pdf'); 
  do markitdown ${i} -o output/${i%.pdf}.md; 
done

Option 2

When more complex PDFs required, can use markit.

Install requires v1.8.0 in mac due to recent bug making it 20x slower

pip install markit==1.8.0

Run (change commands based on requirements):

marker --disable_image_extraction --output_dir ./output/ --pdftext_workers 2 --disable_ocr ./
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment