- Brooklyn, NY
💬 NLP
A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!
PAGE XML format collection for document image page content and more
Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
💫 Industrial-strength Natural Language Processing (NLP) in Python
🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…