Convert a batch of PDF files to text files

I have thousands of PDF files that I need to convert it to plain text for processing files.

I can convert them individually to help save it under... dialogue. However, I would like to set up an Action to automate the process, but I see no action in the Action window that would allow me to specify the output format required.

Am I missing something, or it is not possible to convert a batch of PDF files to plain text files in Adobe Acrobat X?

Thank you

Thanks for the tip of Bernd.

I don't really know javascript, but I managed to make it work by referring to the following page and using my rudimentary knowledge of PHP:

http://acrobatusers.com/tutorials/how-save-PDF-Acrobat-JavaScript

I created a new Action with the Action Wizard.

I put "start with:" in my files folders (for example, C:\MyFiles)

I added in step "Run the Java Script" and used the following code:

/ * PDF text * /.

this.saveAs ("/ exit/MyFiles/c" + this.documentFileName + ".txt", "com.adobe.acrobat.plain text");

Note: The value "/ exit/MyFiles/c ' where you need the files to be issued (' / output/MyFiles/c ' corresponds to C:\MyFiles\Output).

I put "save in:" "do not save the changes" (as the fact required javascript code save).

I then ran to the Action, and files such as C:\MyFiles\101.pdf have been issued in the form of text files to C:\MyFiles\Output\101.pdf.txt, which is good enough for me.

I do not seem to run into lots of 350ish however, or even Acrobat stumbles on and dies.

Hope that helps someone else in the future (i.e. the future self).

Gruff

Tags: Acrobat

Similar Questions

  • Convert a batch of PDF files to TIFF

    I'm running Adobe Standard X and can't see where batch convert multiple PDF files in TIFF format, so that our estimating software can open them.  I know it's a piece of cake to do in Acrobat 9 Standard, but I must be overlooking something here.  For what it's worth, all the source files are located in the same folder.

    I know this is an old post, but it is how it is possible with Acrobat Standard.

    It can be done with the standard, but not in a single step as pro.

    Open Acrobat and under Getting Started Draft "combining files in PDF.

    Drop all your PDF files in the box and click on "combine files".

    Once the file is created, go to file > save as > Image > Tiff

    Each page will be recorded as a separate tiff file were all would you choose to save.

    Only problem, is that the name of each page will be the same except an increment number.

  • How to convert a batch of pdf files to TIFF

    I need to convert lots of files pdf MultiPage 2000 in a TIFF MultiPage with the same name as the original pdf file

    He can not do with the drive, but it may be possible in Acrobat.

    Try asking in the forum of Acrobat.

    http://forums.Adobe.com/community/Acrobat/creating__editing_%26_exporting_pdfs

  • Batch optimizing PDF files using the custom preset?

    Anyone know if it is possible to batch optimize PDF files using a preset, I created in the 'Optimize PDF' in the optimization options advanced window? Is there a way to this script? I can't find a way to do it. The standard settings to "Reduce file size" are not suitable for what I'm trying to do. Thank you!

    Yes, you can do it. In the dialog box modifier change your action, add a Save command

    and then click the button to specify the parameters below. Check the PDF file

    Optimizer and click settings. Select your custom optimization

    Profile and you're ready to go!

  • If my word document is Ko 106480 how can I convert it into a PDF file if it exceeds the limit of 100 MB

    If my word document is 106 480 Ko how can I convert it into a PDF file if it exceeds the limit of 100 MB

    Get Acrobat and you won't have these limitations.

  • Can you confirm if I can convert a password-protected pdf files

    Before registering for the service can it be confirmed that the pdf to excel import works when the files are protected by Word

    Hi Steve,.

    If you want to use service Pack PDF for conversion, so you can not convert a password protected PDF file. It's something possible with Acrobat Pro DC.

    Thank you

    Abhishek

  • Acrobat Pro 11.0.10 does not allow to convert me or batch convert MS Word to PDF files

    I installed the latest version of Adobe Acrobat Pro for Windows (11.0.10) as a trial version.

    When I convert a file Menu file / create...  PDF from file, a dialog box appears.

    In the list there is no mention of MS Word (.doc, docx) files anywhere in the list of supported file types.

    However, Excel and Powerpoint are in the list.

    If I try to go to the batch to create files and manually, drag a document MS word on the palette, that adds it to the list but when I click on OK to start the conversation by lots, that it fails to convert.

    I am running Windows 7 and and 2013 Office installed with all the latest service packs and patches applied.

    I tried to do a repair of Microsoft Office, but that he did not set.

    Uninstalled Acrobat Pro several times and also did not set.

    I will definitely not buy this software if it does not recognize the Word as the reason that I was experimenting, he had to give me a way to bulk convert a lot of Word documents both rather having to open each one in the word itself and save to the PDF format.

    Kind regards

    Gray.

    Hello

    Thanks for the information. I guess that's causing the problem.

    Can you please try to change the 'Default' to 'Word.Document.8' registry key value, and let us know if your problem is solved.

    Thank you

    Tanvi

  • Why ODC converts images jpg in pdf files when checking in content

    Hello

    I use ODC 10 gR 3 32 bit and my problem is that when I check the native file in a jpg to the Webcenter 11 g 11.1.1.6 content image is transformed into pdf. When I connect console content Webcenter and search the content the two natives a visible web files are pdf files.

    What should I do if my native file continues with the same extension (JPG)?

    The web image file is ok in pdf, so no changes are needed for this one.

    Concerning
    Carlos

    ODC does not work with concepts viewable web native. On the other hand, you can have or the files with formats of input and output file (s) with the same or other. Note that mainly ODC use the outputs of a scanner for its entries. ODC will treat these entries for you (lectures reading of bar codes, separates the pages in a batch of documents, recognizes or allows manual entry of metadata categorization, etc.) and finally to validate results of selected repository (it can be the same file system or a database!). You can retain the original format, or change it - an often used option transforms image formats such as TIFF in the 'Searchable PDF' (a PDF with text layer, which can be indexed for full-text; OCR is used to get the text layer).

  • Cannot convert Web Page to PDF: file not found

    Hello

    I can't convert some open web pages as PDF documents:

    I select the option "convert Web Page to PDF ', after which the dialog box opens to ask me where I want to save etc. A file name is already suggested, if I click on save: I get an error message saying that the file is not found then and to check the name of the file, and then try again.

    I have tried to change the file name & record the location, but get the same result.

    Any ideas, please?

    Acrobat Pro DC, running in Windows 10, Internet Explorer 11

    Microsoft in their wisdom made sure the convert in acrobat function will not work unless you disable some of the web security in internet Explorer in Windows 10.  In Internet Explorer, click: tools, Internet options, security and uncheck the Enable Protected Mode.  You must restart Internet Explorer.

    This isn't a matter of adobe.  But maybe they could find a way to make their product work under this environment without turning off this feature in Internet Explorer in Windows 10.  Just talk as I pdf another program that works very well even with the Enable Protected Mode check box.  Hey - at least Microsoft included in version 10 to win, IE there is no way yet to use addons in the Edge program.

  • convert article blog in PDF file

    I want to convert PDF file some articles from my blog (http://www.tiertier.com/blog) and share PDF files on slideshare or other documents sharing community. Should what product I use? The cloud PDF?

    Hi Sam,

    You must use Acrobat to that effect.

    You can download a free 30 day trial of Acrobat: Download Adobe Acrobat products. Standard, Pro | DC, XI, X

    Kind regards

    Rahul

  • Is anyway to convert massive amount of PDF files in word document without having to go through each and one of them

    Adobe PDF does wonderful job when it comes to convert the PDF file to Word document. However, I would like to know if there is anyway to convert my many PDF files in Word without having to go through each and of them clicking me and confirming questions for the conversion? For example, there is an option to convert many files in PDF format simply by checking a box to indicate the ones I want to be converted. However, I don't see this option when it comes to convert a PDF to Word format. I need to go through them each and individual verification of various things such as folders to destination. It would really save me some time because I wouldn't need to sit in front of my laptop, manually by clicking more.

    Hi chaconne1003,

    You can do this by creating an action ExportPDF in Acrobat, however, acrobat would encourage it several times in the process, usually for the location of the file that results, the file format resulting, then it really isn't an automated process.

    However, if you have a subscription to the service online pdf export, all you need to do is select all the files at once which are to be converted, set the option of conversion to Microsoft Word and then you can sit back and look at the files being downloaded and automatically converted.

    Once all the file conversion is complete, you will be prompted to download the converted files. However, it can also be tedious process if the size of the file being downloaded pdf is large or you are on a slow network.

    I hope this helps.

    Kind regards

    Rahul

  • How to convert a word in PDF file?

    How to convert a word document into a .pdf file?

    Hi wertheca,

    If you have Acrobat (full version), you can convert a file to PDF by choosing File > create > PDF of the file. You choose your Word document, and then click open to convert the file to PDF format.

    If you do not have Acrobat, you can use a subscription of services online in Acrobat to convert almost any file format into a PDF file with a simple click or two. For more information, see more Acrobat, PDF package, PDF Export & more | Documents Acrobat solutions

    I hope this helps.

    Best,

    Sara

  • The Acrobat 9 batch - get PDF files of the folder names?

    Hello

    Can someone give me an example with the use of batch (when the value of a specific folder/files) for a list of the names of files in this folder? I was not able to find examples online. It does not seem be any batch built in sequences that automatically do so I guess it will have to be done with javascript?

    In the end, I need to automate a process (I was hoping to create a batch sequence to do this) which will allow me to invite a user to choose a record source to PDF files and then based on this selection, run a Javascript that I create, which will merge all of the PDF files in this folder into a new PDF apply some culture settings to each page, and then ask the user where to save the merged PDF file?

    Has anyone ever done something like this? I'd love to see an example of how this could be able to achieve.

    Thanks in advance.

    Have you tried to create a batch process to test (now called 'Actions')?  Actions in Acrobat 10 has the 'Merge all the files in the folder' option, and batching in previous versions has always included a commad culture page, but also an option to ask the user where to save the file.

    There are many examples and articles on this topic at http://www.acrobatusers.com

    Thom Parker
    The source for PDF Scripting Info
    pdfscripting.com

    The Acrobat JavaScript Reference, use it early and often

    Then more important JavaScript development tool in Acrobat
    The Console window (video tutorial)
    The Console Window (article)

  • Office Jet 8500 a makes a scan of document, converts and saves the PDF file, but the PDF file does not open

    My new Office Jet 8500 (A910-G) seems to correctly perform a scan to PDF, but the PDF file does not open, gives me the error message "the selected document cannot be opened" in PDF Complete.

    I agree that it's a software because it works well with Adobe Acrobat.  It's just PDF Complete (which was free with my HP computer) does not work!

  • How to convert XML into a PDF file?

    I use Adobe Acrobat X Pro 10.0.  I saved a PDF file in an XML 1.0 document.  I would like to be able to send this XML document and have the recipient opened the XML into a PDF file.  Does anyone know how to do this?

    Thank you

    Hello hilaryb49990629,

    Thanks much for posting on the Adobe forums.

    I'm sorry, but XML is not a type of file Adobe Acrobat/Reader can open. It can be opened in a browser or Editor (such as Notepad) to view its content.

    Kind regards

    Ana Maria

Maybe you are looking for

  • HP Scanjet G3110 Photo Scanner

    The description of this scanner (and other similar) says it will scan in PDF format. Will I need any extra software (i.e., Acrobat X) in order to obtain a PDF version of my scan? If you can respond directly to my email: [edited by Moderator], it woul

  • HP Pavilion g7 network card problem

    I have re-installed windows for windows 7 ultimate 64-bit. After installing the NIC does not appear in Device Manager. I tried to install the driver from the internet device, but none of them worked. The computer display indications etranet or wirele

  • Unable to wake up after sleep - HP pavilion DV6t - aiu55av - 9

    Unable to wake up after sleep - HP pavilion DV6t - aiu55av - 9 tried on different keys-ctl alt - del - nothing works

  • bypass the disable password & systems admin

    Hello I forgot the password on Pavilion 15-n050eg. After 3 incorrect entries, I get the massage system off with code 82870005 Could you help me please? That is the right code to work around?

  • What happens if the controller fails?

    Hello It is a fundamental issue. What happens if wireless controller (4404) fails? All APs will be going down too, or only new users will not access not the wireless network? TIA Alex