New ask Hacker News story: Ask HN: What are you using to parse PDFs for RAG?

Ask HN: What are you using to parse PDFs for RAG?
2 by carlbren | 1 comments on Hacker News.
Hi, I'm looking for a simple way to convert PDFs into markdown with integrated images and tables. Tried Llamaindex, but no integrated images. Tried Langchain, but some PDFs will have the footer being parsed before the top. Tried to use Adobe PDF API, but have to pay $25K upfront!

Comments