cm0002@lemmy.world to Programmer Humor@programming.dev · 3 months agoDOGE employeelemmy.worldimagemessage-square96fedilinkarrow-up11arrow-down10cross-posted to: [email protected]
arrow-up11arrow-down1imageDOGE employeelemmy.worldcm0002@lemmy.world to Programmer Humor@programming.dev · 3 months agomessage-square96fedilinkcross-posted to: [email protected]
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up0·edit-23 months ago$ pandoc doc.pdf -o doc.txt Edit: welp, pandoc can’t do that. pdftotext it is.
minus-squaremexicancartel@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up0·edit-23 months agomagick file.jpg file.html Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up0·3 months agonot really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
minus-squarestetech@lemmy.worldlinkfedilinkarrow-up0·3 months agoI thought pandoc didn’t support from PDF, only to?!
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up0·3 months agodamn it, you’re right. should probably have checked that…
$ pandoc doc.pdf -o doc.txt
Edit: welp, pandoc can’t do that.
pdftotext
it is.Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
not really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
I thought pandoc didn’t support from PDF, only to?!
damn it, you’re right. should probably have checked that…