Skip to main content


have anyone here tried the Surya Text recognition? Surya is a multilingual document #OCR toolkit which can perform Accurate line-level text detection and recognition (OCR) in any language
VikParuchuri also have done marker, a tool to Convert PDF to markdown quickly with high accuracy
https://mastodon.social/@pythonhub/111563617706321152
link to surya
https://github.com/VikParuchuri/surya
cc @chikim