How can I correct a text 'hidden' in a searchable PDF file?

This seems to be a simple question. However, the answers are always complex, do not give the desired result and often answering a different question altogether. I say all this just to warn people at the outset that the 'problem' is easier than how many people and PDF application developers, including Adobe, generally it include the proposed "solutions" are invariably a total... Well, botch a reasonable word if somewhat underestimated.

Here's the real problem:

I have "searchable" PDF files created by scanning documents and their implementation through an OCR process. I create "searchable" PDF files to archive, index and possibly allow search for scanned documents. A "searchable" PDF file meets these criteria better than any other commonly used, 'portable' archive format - although I'd be happy if someone could indicate an obvious alternative I can remember. I don't have a perfect OCR results. If I need a document to edit or maybe feed in a spreadsheet or a database, I expect to be able to reprocess the page images in a given "searchable" OCR PDF and convert the content Word, RTF, Excel or another file format as needed with more carefully to the results for the archived document itself. Therefore, the "searchable" PDF document is scanned page images making it up while the OCR generated 'text' is secondary, but still important. Therefore, each file must contain images of the scanned page in sufficient detail to be efficiently converted by OCR if possible and quite readable for people who consider the images to be able to establish a process of OCR can not understand. Once scanned, these pages are the 'document' and so 'immutable '. However, OCR is not perfect. For an archive of searchable documents, it doesn't have to be, but some mistakes are important because they can prevent the document to be found by a search. Therefore, there must be a way to view and, if necessary, change the text 'hidden' in a "searchable" PDF file without changing the view of a document or how it is printed. No redaction. Not visible "corrections." None of the PDF stuff publishers want to insert in a PDF file when you edit it. I don't want to modify the document without exporting to a format suitable for an editable document. I just want enough 'correct' text hidden in a PDF "searchable".

I apologize for the length and redundancy in my description of the problem. However, past attempts to explain my problem and objectives as well as what I've seen in response to similar questions through the Internet indicate that most people trying to answer this question coming home the same point of view shared by most, if not all, sellers PDF tool or application. They seem to think that all desire to edit a PDF file is a desire to have a PDF processing any. Or, they assume that the process of OCR employee may need tweaking of the means by which people apply and then a process like 'find the suspects' is sufficient to deal with errors. But no, it's not what I'm trying to accomplish and the responses that deal with these subjects do not meet this issue.

In short, what tool or the application of any provider will reveal "searchable" hidden text in a file PDF produced by any OCR or any other process and then enable corrections to the text hidden without changing the document display settings at all? Note, includes hidden text generally limit the information in the box which indicates the portion of the picture that the text has been recognized. This information must be lost or changed when you change the text "available".

Thus the tools or applications that can do that? If Adobe Acrobat Pro XI can (use of a copy of the trial showed that hidden text content may be revised, but publishing has failed by any simple way I work while trying the application), fine. However, list $ 500.00 or even a $200.00 can update a copy of Adobe Acrobat X Standard that came with my scanner is a lot of money for personal use when reviewing and changing the hidden text in a PDF "searchable" OCR is the only function I need. Therefore, other instruments suggested or applications that make what I need for less would be greatly appreciated.

I scan a document directly or starting with a PDF "only images", no Adobe product offers a convenient way to examine and correct the 'hidden' text generated by either the process Adobe Acrobat OCR OCR tools or third-party applications. When you use Adobe Acrobat, the 'Find the Suspects' function will allow correction of the flags of Acrobat OCR results, but a lot of mistakes-for example the names are a particular problem because incorrectly recognized the outcome of the names of "viewable" files that could not be found when looking for files containing the text elements that are among the most likely entered search terms - will not be reported as "suspect" and cannot be corrected in a file to be saved as a file PDF 'searchable' where 'searchable' is Image indexable () 'Exact)"- that is a PDF file containing images of scanned documents original, no altered page - with the"searchable"text written in the file as text 'hidden' 'under' the images on the page.

For scanning and archiving paper documents with text 'hidden' to enable computerized research, products from other suppliers should be used. Adobe Acrobat products are insufficient to the task.

If you want to scan documents, treat them with OCR, and then convert the results to other formats - for example Word, RTF, etc. - to be edited by hand, then Adobe products can be used. However, just at how they perform this task is outside my experience and is beyond the scope of my original question.

Tags: Acrobat

Similar Questions

  • How can I change my fill of tru for the pdf file and the sign? It will never change.

    How can I change my fill of tru for the pdf file and the sign? It will never change.

    In Adobe fill & sign you can add text, checkmarks, an X, a point and a circle and sign documents.  It does not "change" the existing content of the document.  Here is a tutorial: Tutorial: Introduction to Adobe fill & sign

    Thank you

    Josh

  • How can I email page 1 of a 2 page pdf file?

    How can I email page 1 of a 2 page PDF file?

    Hi chovey

    You can do this with the help of Adobe Acrobat,

    Please download the trial version and use it for free for 30 days.

    Please visit: https://www.acrobat.com/free-trial-download.html

    To retrieve the pages: http://helpx.adobe.com/acrobat/using/manipulating-deleting-renumbering-pdf-pages.html

  • How can I get rid of brands lining around a pdf file?

    How can I get rid of brands lining around my pdf? They just came after that the pfd was created.

    Two options: (1) back to your application that created the PDF file and disable them.

    (2) If you have Acrobat Pro, you can use the crop tool to crop away.

  • ENVY 4500 e-All-in-One-Series: 4500 want don't scan text to produce a searchable PDF file

    I used to have a printer Photosmart 4400 C that would allow me to scan to produce searchable PDF files, i.e. I could select letters or words in a text selection tool in my PDF reader.   Today, I just bought a DESIRE 4500 e-All-in-One-Series.  It will scan only text to give me a graphic PDF that is not searchable.  I've updated to the latest version of the EN4500_198.exe driver, but it does not help.

    Does anyone have a solution?

    Hello

    No, the 4500 desire doesn't have any default OCR software, you will have to pay for the software and believe it or not, that the software is more expensive than the printer itself. You can buy the software or you can try to use some free software around:

    http://www.techsupportalert.com/best-free-OCR-software.htm

    Kind regards

  • How can I correct the spelling of the name of the file (or folder)?

    I use Win 7. Accidentally, I mistyped a word of a folder title. I wrote the word with lower case rather than capitalize it. When I right click on the folder name and correct my bad typing the folder name becomes the word is spelled with lower case. Similar things are producing with the file names. In previous versions on Windows, change or correct a folder/file name was a wink. How can I do this in Windows 7? Thanks in advance for any help.

    You actually change the name. What you see is the original buffer that has not been marked "dirty" by the change in the case. If you press F5 , you can force a refresh and the revised spelling should show.

    BTW, tapping F2 initiates an editing session to rename all as right, click Rename or gently second by clicking on a selection.

  • How can I PERMANENTLY change the default printer to Adobe PDF files using my physical printer instead of SIMPLY printing a pdf in a new PDF file?

    Whenever I restart Adobe Reader DC default printer is automatically reset to print any PDF file to the "print to PDF file" as the default printer.  How can I change the "default printer" always default to be my physical default printer, instead of "print of PDF?»»»

    PRINTING a document which is already a pdf file to a pdf file is absolutely ridiculous and stupid for the following reasons:

    1. If you want to save a document that is already a pdf in different PDF file, you can use the function "Save as" in the file menu.
    2. If the user select "Print" to PRINT a file they ther NOT try to SAVE a file.  (We'll give credit to the average user to understand the difference between PRINTING and SAVING of a document).
    3. Most of the users do not need to duplicate the storage of a document on their drives to computer
    4. Users of mobile devices, by the fact that they have already downloaded the PDF file have already used storage space of EXTREMELY valuable and LIMITED memory with the download of origin and most certainly so NOT to use more than this space such a resource limited printing!

    As an engineer, hardware and software for the past 40 + years to really understand the value of making easy allow the end user to change the configuration of something as simple as the setting for the default printing device to use!

    For the record, I searched the online help as well as the forum Adobe Reader community for a thread that explains how to change the printer by default so that whenever I starts the Adobe service it will automatically use the printer default physical connected to my computer to "PRINT" any PDF document.  If I missed something that deals with this problem please point me about it, if no such band rolling exists, then please tell me before making this change.

    Thank you for your help in this matter!

    Sincerely,

    Larry A. Coates

    Hi larryc57256426,

    You can set your default printer following the steps mentioned in this document KB change the default system printer.

    Check in the print dialog box, you have unchecked "print to file" under "Advanced settings."

    Kind regards
    Nicos

  • How can you save a .docx word attachment in a .pdf file in Outlook 2010

    I receive many emails I have attachments in word .doc or .docx format.  I want to be able to "save under" a PDF WITHOUT having to open the document, and to go through all this rigamarole.  There must be a way to "save under" a PDF somewhere in this program...

    Please note that I don't want to save the E-MAIL message as a pdf.   I want to save the attachment, which is a word doc to a pdf file without having to open it!   Tab file, save as saves ONLY the message and not the attachment in pdf format.

    Thank you!

    You must manually convert the files if they are not in the right format, or tell people who send allows them to use PDF.

    The fastest way I can think of is to print documents using the enabling Adobe PDF writer.

  • How can I remove my own pestabished presets to export PDF files?

    Hello

    I created some presets to export pdf files. Now I want to delete some of these presets, and I don't know how to do it.

    I have Illustrator CS4 on mac 10.8.4...

    .

    Mac OS10.7.5, but I imagine it's the same thing...

    Users / [your account] / Library/Application Support/Adobe/Adobe PDF / Settings

    Custom work options should be located there. Just remove the files. You will need to restart apps to see their removal in the menus of the application.

    .

  • How can I get a link to open two different PDF files?

    Even if I'm just the guy to the hardware support where I work, I was invited to maintain the web site. The last demand is that they want to get a link to open two PDFs. I know you can do something like this with web pages, but I don't know if you can do the same with PDF files. I don't know the basics of Dreamweaver, then, if it's possible, please explain it to me as I am has 5-year-old. However, I'm not afraid of the code. Thank you, in advance, for any assistance.

    Hello

    You can do this by using the javascript onClick event to load the 2 files pdf, but if I were to see such action when clicking on a link, which I guess he was trying to open several pages (according to spam sites) and immediately close the tabs/windows and never visit the site again.

    If you do not need two pdf files then merge two PDFs in one and a link to the file as usual.

    PZ

  • On my Imac, how can I do preview my default application to open PDF files?

    bold text

    Hi brent1234,

    You should take a look at article in the Knowledge Base Panel Applications - define how Firefox handles different types of files. It will show you how to associate files with programs that should open types.

    Hope this helps!

  • How can 7520 all-in-one, I scan to a pdf file?

    I just got a 7520 all-in-one and want to scan documents to pdf format, but the only choices I'm given are all images. I'm doing something wrong?

    hstn9,

    Welcome to the Forum of discussion Forum HP printer.

    Install the Photosmart software 7520 full functionality on your computer to the printer.

    General instructions to install the printer software

    • Open drivers HP & downloads
    • Enter your printer model information
    • Select your printer in the list of the 'results '.
    • Enter your operating system from the menu drop-down
    • Click NEXT and scroll down
    • Find the category driver - software product installation
    • Select the base driver, e-print, or a full features software
    • Save the *.exe installation package (s) on your computer

    The files will probably save in the "folder"downloads. "

    Next:

    • If it is available, you can download print and scan doctor and / or other programs of the category-utilities

    TIP:

    Install the utilities / tools first. bit installation packages are smaller in size

    ·

    • If you have control of "Admin", you can highlight the package and "double-click" to install it, otherwise just right-click, select run as administrator and install.

    NOTES:

    ·  On the bottom half of the main Web page for your printer, you can find useful to help with the installation videos.

    ·  Scroll through the list of videos back until you find the video that best fits your situation.

    ·         If you use a USB connection: download and install the driver software before connecting a USB cable.

    ·         Ethernet: If the printer supports Ethernet, you can connect the printer to the network (and assign the printer an IP address for the router, if you wish).  Once the printer is connected to the network, make sure that the printer is turned on, and then install the printer software.

    • Once the initial installation is complete and functional, don't forget to check and install the relevant updates - check the category - updated

    There is valuable information about the Web site, including manuals, how-to pages and alerts for your printer.  Be sure to take a comprehensive look at what's available.  To bookmark the page.

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

    Start the wizard printer (program for the centre of alternatives):

    Windows key + R (open the command run) >

    Paste this line:

    'C:\Program HP HP Photosmart 7520 7520 Photosmart series.exe series\Bin\HP' - UDCDevicePage to start

    Opens the Printer Wizard > select "scan a Photo Document or ' > select 'Scan to File."

    The chart in the example:

    At the bottom of the "HP Scan" menu, you'd like to put in place a shortcut customized for your scan options - the shortcut contains the parameters for the analysis you select and then appear at the bottom of the list of available scan options.

    Create Shortcut Wizard for the printer on your desktop:

    • Right click on a clear view of your desktop > new > shortcut
    • Select the location (command) for your new shortcut:

    'C:\Program HP HP Photosmart 7520 7520 Photosmart series.exe series\Bin\HP' - UDCDevicePage to start

    • Type a name for the shortcut, for example, 7520 Assistant

    NOTES:

    -Argument from the beginning at the end of the command line keeps the wizard from the ePrint on your browser Web page.

    Once the new shortcut is created, you can right click on it > properties > change icon

    When you select "Change icon" you might see an error message indicating there is no icon for the shortcut - continue on and select from your personal collection or choose an icon from those provided by the system.

    ==========================================================================================

    Click on the star of congratulations !

    It's a nice way of saying 'Thank you' for help.

    Although I strive to reflect best practices of HP, I do not work for HP.

  • How can I make Adobe XI the default program for PDF files in Windows version 1511 10?

    I have Adobe Acrobat X! Standard (11.0.13).  I have recently upgraded to Windows 10 version 1511.  After the upgrade, it turned my default pdf for Microsoft Edge program.  I would like to change the default value to Adobe Acrobat Standard XI.  I tried to open a PDF with Adobe and Adobe followed automatically make my default program Adobe.  However, when I selected OK, I received an error message Adobe claiming that couldn't make Adobe default program.  Any suggestions?

    Hi johnv1955,

    Right-click on the PDF file, select open with-> choose default program, select Acrobat XI & confirm the changes.

    Kind regards
    Nicos

  • How can I get Adobe Acrobat Reader DC for open PDF files in their own separate windows?

    I find it interesting that in support of multiple monitors becomes more standard in operating systems, applications start requiring all documents related in the same window, such that sometimes you have to use Registry hacks to get documents in separate windows to be able to easily compare side-by-side all multiple monitors.

    First it was Microsoft Excel... now, it seems that Adobe Reader required documents in a single window.

    Although Adobe Reader at least allows you to click on the "Window" menu and choose "New window" to "move" a PDF file in a separate window (quite annoying rather than actually move the tab selected in the new window, it opens another copy, which requires that you always close the first copy in the original window). I prefer the default behavior to automatically open documents in a new window when I double click on it in Windows Explorer to open them.  At least give me a preference option for this...

    Does anyone have solutions of (permanent) workaround for this?  I know there are changes in registry for Excel (that I put in place)... I hope it's easier for Adobe Reader.

    Thank you

    Murray

    Hello

    Please navigate to Edit > preferences > General > uncheck "open documents as new tabs in the same window(requires restart). > once unchecked please re-launch Reader.

    You would not be able to see the documents in separate windows.

    Concerning

    Sukrit diallo

  • How can I change the text in the Messages to the vertical?

    How can I change the text in the Messages to the vertical?

    If you mean change the orientation, simply turn the phone. If this does not work, close Messages and then run again.

    To close the Messages, press the Home button twice quickly. You will see small glimpses of your applications recently used. Drag to the left to find the application you want to close. Swipe up on the preview of the application to close.

    If it does not, the strength to restart the phone. No data is affected by this. To force the reboot your device, press and hold the two buttons of sleep/wake and home for at least ten seconds, until you see the Apple logo.

Maybe you are looking for

  • Need to check the update of the BIOS Portege R830 without Intel TXT

    Hello Forum, I have exactly the same problem as described here 2 years ago:http://forums.computers.Toshiba-Europe.com/forums/thread.jspa?threadID=65519 I need to update my BIOS, but I can't because the update of the BIOS told me that the Intel TXT is

  • Tecra S10 - 10F - need Server 2003 or 2008 LAN driver

    I'm looking for a driver lan for Toshiba S10 - 10F for Windows Server 2003 or 2008.Or at least, I would like to know the model of network card. Thank you

  • graphic problem double y510p

    Hello. I have two graphics cards: -integrated: intel graphics HD 4600 -dedicated: nvidia geforce gt 750 m I recently reinstalled sytem and tried to pass windows 10. I know that pilots are not compatible, so I reinstalled windows 8.1, installed latest

  • than graphical

    Tengo a lost, exporto UN programa, the graphical o aplicativo cuando varia me of position, decir if Cree el programa donde pantalla del computador are convex Tower, cuando lo instale in another PC con pantalla mas los objetos the pantalla ancha is de

  • Visa read write on port series independently and at the same time!

    Hi all! I need to read and write data to and from a serial port (rs232) independently and simultaneously . I'm sorry that I can not put a picture of that, but it's very simple: I used "set up the serial port" and forwarded to two while loops, the res