🧀 BigCheese.ai

Social

Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o

🧀

An open-source OCR API project on GitHub that leverages OpenAI's powerful language models with techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents.

  • Uses OpenAI's GPT-4 Turbo with Vision
  • Features parallel PDF conversion
  • Allows batch processing
  • Markdown formatting for output
  • API endpoint provided for usage