Skip to main content

View Post [edit]

Poster: yogiks Date: Feb 23, 2016 12:46pm
Forum: opensource Subject: PDFs are uploaded as Image Container PDF instead of Text PDF

The "Image Container PDF" is used as the default file format when I upload any pdf file using *ia upload* command line utility. But I would like this to be uploaded as "Text PDF". How do I do this when I upload any pdf file? I don't want to change file format for every file in the metadata editor after uploading it.


thanks
yogi

Reply [edit]

Poster: Jeff Kaplan Date: Feb 23, 2016 9:35pm
Forum: opensource Subject: Re: PDFs are uploaded as Image Container PDF instead of Text PDF

currently that is the only way to change it. are you certain they are text and not image pdfs? have they not derived successfully?

Reply [edit]

Poster: yogiks Date: Feb 29, 2016 12:54am
Forum: opensource Subject: Re: PDFs are uploaded as Image Container PDF instead of Text PDF

Yes, they are texts in Kannada langauge but are low-quality scans. :(
It would have been great if they were derived to *filename.djvu* but are currently derived to *filename.gif*, *filename_djvu.xml*, etc.,.

Is there a way to specify anything to make a file derive to djvu format (*filename.djvu*) while uploading the pdf files?


thanks
yogi

Reply [edit]

Poster: Jeff Kaplan Date: Feb 29, 2016 9:49am
Forum: opensource Subject: Re: PDFs are uploaded as Image Container PDF instead of Text PDF

we are in the process of stopping deriving .djvu files as there is so little request or demand for them.

Reply [edit]

Poster: yogiks Date: Mar 3, 2016 11:02pm
Forum: opensource Subject: Re: PDFs are uploaded as Image Container PDF instead of Text PDF

Okay. But users can still upload .djvu files along with pdf right? Will the uploaded djvu files will also be removed?

thank you

Reply [edit]

Poster: Jeff Kaplan Date: Mar 4, 2016 8:18am
Forum: opensource Subject: Re: PDFs are uploaded as Image Container PDF instead of Text PDF

we never remove source files so yes, you can upload .djvu.