Skip to content
View afoltzm's full-sized avatar
🏗️
🏗️

Organizations

@brooklyndefenders

Block or report afoltzm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

💬 NLP

10 repositories

Links to awesome OCR projects

2,760 345 Updated Jul 6, 2024

A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!

Python 271 34 Updated Feb 4, 2024

PAGE XML format collection for document image page content and more

XSLT 63 8 Updated Jul 7, 2021

Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format

C++ 44 7 Updated Apr 16, 2024

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 13,756 1,002 Updated Sep 15, 2024

Official implementation of Character Region Awareness for Text Detection (CRAFT)

Python 3,075 872 Updated Jul 16, 2024

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 23,909 3,122 Updated Sep 24, 2024

A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.

Python 1,389 358 Updated Aug 1, 2024

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 29,755 4,363 Updated Oct 1, 2024

🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…

Python 16,970 1,853 Updated Oct 1, 2024