Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

As someone that spent quite a bit of time with table-transformers, I would definitely not recommend it. It was one of the first libraries we added for parsing tables into our chunking library [1] and the results were very underwhelming. This was a while back and at this point, it's just so much easier to use an LLM end to end for parsing docs (Gemini Flash can parse 20k pages per dollar) and I'm wary of any approach that stitches together different models.

[1] https://github.com/Filimoa/open-parse/



Do you have some benchmark results I can look at that compares results?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: