Replies: 1 comment
-
|
new better version ... in all cases
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
In my opinion, the following script covers 90% of all cases when someone wants to convert a PDF to TXT/Markdown.
maybe some ideas to add to your internal commands (but for shure i missed somewaht) ^^
Since I am not a programmer, it is certainly not very clearly structured and a few things are missing and maybe it give more common base ways (e.g., internal commands; checking whether it is a valid or invalid PDF; first extracting images to generate image descriptions via LLM... etc.).
what i have:
The code isn't particularly elegant, but it works, and I wanted to make that clear and ask what I could simplify, although I need some information and formats to do what I'm doing ;)
start_with tables_img02_drawing06_hypen01_layout01_textinsertion02_max03.py
Beta Was this translation helpful? Give feedback.
All reactions