Search Results

Search found 7251 results on 291 pages for 'pdf parsing'.

Page 4/291 | < Previous Page | 1 2 3 4 5 6 7 8 9 10 11 12 | Next Page >

How to turn a pdf into a text searchable pdf?

- by don.joey

I have a number of scanned documents in pdf and I want to be able to search them. How can I do that? Essentially I have to OCR the pdf and then blend the extracted text back into a new pdf. I have unsuccesfully tried pdfocr (which gives me this issue: https://github.com/gkovacs/pdfocr/issues/7) pdfsandwich (of which the software center says it…

Read the article
Convert PDF to PNG using ImageMagick

- by StackOverflowNewbie

using ImageMagick, what command should i use to convert a PDF to PNG? I need highest quality, smallest file size. this is what I have so far (very slow by the way): convert -density 300 -depth 8 -quality 85 a.pdf a.png Looking at what Gmail does when a user "view" a PDF, the quality is awesome and the file size very minimal. The DPI is just 96…

Read the article
Silent Printing of PDF From Within Java

- by Paul Reiners

We are looking into silent printing of PDF documents from within Java. The printing will be invoked from the desktop and not through a browser so we cannot use JavaScript. PDF Renderer is an operational solution but their rendering quality is not acceptable. iText does not seem to be pluggable with the Java print service. There are some…

Read the article
Parsing a string, Grammar file.

- by defn

How would I separate the below string into its parts. What I need to separate is each < Word including the angle brackets from the rest of the string. So in the below case I would end up with several strings 1. "I have to break up with you because " 2. "< reason " (without the spaces) 3. " . But Let's still " 4. "< disclaimer " 5.…

Read the article
Parsing tab delimited file with double quotes in Perl

- by sfactor

I have a data set that is tab delimited with the user-agent strings in double quotes. I need to parse each of these columns and based on the answer of my other post I used the Text::CSV module. 94410634 0 GET "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.6; .NET CLR 2.0.50727; .NET CLR 3.0.04506.648; .NET CLR…

Read the article
Perl: parsing string enclosed by double quotes

- by sfactor

I need to parse tab/space delimited files that have a lot of columns in Perl. The values are such that the there are large strings enclosed within double quotes. These strings can have any characters such as tabs and spaces or anything else. When I try to parse them with the split function it splits these strings as well. Now how can I…

Read the article
Subscription service or software to handle a Magazine's PDF

- by Paolo

I'm looking for an installable or hosted software (service) to handle the process of public users subscribing to the Magazine and receiving the PDF automatically upon an admin upload the new one. The system will have to: handle the money part (PayPal&Co. are OK) let user buy old issues of the Magazine warn user on subscription…

Read the article
Looking for .NET library to create PDF

- by aximili

We are looking for a .NET PDF creator. It needs to be .NET, so we can just copy the file(s) onto the server, not having to install anything. We only need to create a PDF with some text and images and a heading, that's all. Anyone know a good one? We are happy to buy if there is a good one that is easy to use. Thanks in advance.

Read the article
How to embed an image in a PDF using cfpdfform

- by JGrimm

I'm dynamically generating a PDF with a few variables but also need to be able to embed an image on the PDF. Anyone have any experience doing this using ?

Read the article
Print/save full webpage as PDF

- by Oliver

I need a method to be able to print/save the current full webpage as a PDF. I know it can be done if I download a PDF printer and print to that; but I need it to be done without the user having to do anything other than click a button in a webpage. I can't do it via PHP as the page is all client side content, so I'm guessing an…

Read the article
Bloated PDF created by TCPDF

- by Yogi Yang 007

In a web app developed in PHP we are generating Quotations and Invoices (which are very simple and of single page) using TCPDF lib. The lib is working just great but it seems to generate very large PDF files. For example in our case it is generating PDF files as large as 4 MB (+/- a few KB). How to reduce this bloating of PDF…

Read the article
Cropping a PDF File's Margin During Printing

- by JavaMan

I'm using the free Acrobat Reader to print out some pdf documents having very large top/bottom/left/right margins. I want to remove the margins (which are wasting too much space and making the fonts too small). I used to use Acrobat (the paid version having edit features) to crop the src pdf file manually. But since it is an…

Read the article
Populating PDF Fields using FDFACX

- by NWilliams

I was recently asked to preform some updates to an existing PDF document. The changes required were completed using Adobe Designer (the only tool I have available to me). These changes included alignment, and new text. Note that there were fillable form fields on the forms, and they were left untouched. The saved version of…

Read the article
Creating a new Pdf by Merging Pdf documents using TCPDF [php]

- by LuRsT

How can I create a new document using other pdfs that I'm generating? I have methods to create some documents, and I want to merge them all in a big pdf, how can I do that with TCPDF? I do not want to use other libs.

Read the article
Remove or hide PDF layer using ABCPdf?

- by Junior Developer

Is is possible to remove or hide a layer from a PDF using ABCPdf or another framework?

Read the article
Mutating PDF editable fields programatically

- by Chris

Out of tons of questions and answers here about manipulating PDF's with PHP, but none of them seem to fit my requirement. Programmatically, I want to be able to update the content of editable fields. Preferably with PHP. If it matters, the PDF files will be initially hand crafted (as sort of 'template' files that will be…

Read the article
Generate HTML To PDF Control for the .NET application

- by Karan

Has anyone used any open source or paid .NET Control which does the conversion job from html to pdf file? At the moment, i am using Winnovative convertor control. But it has a performance limitation during the generation of bulk pages (like more than 1000) in the pdf. The limitation comes when we use bigger images in the…

Read the article
Converting MS Word Documents to PDF in ASP.NET

- by glaxaco

Similar questions have been asked, but nothing exactly like mine, so here goes. We have a collection of Microsoft Word documents on an ASP.NET web server with merge fields whose values are filled in as a result of user form submissions. After the field merge, the server must convert the document to PDF and stream it…

Read the article
Generation PDF from HTML (component for .NET)

- by Mio18

Can you please point me to open source or a reasonably priced comercial product capable of generating PDF from HTML?

Read the article
Okular can't read pdf files

- by hoang anh Nguyen

I recently have installed Okular on my Ubuntu 14.04. The problem is when I open pdf files, okular gives me the error "Can not find a plugin which is able to handle the document being passed." When I ran Okular by Terminal, this is the message I get. okular(14100)/kdeui (KIconLoader): Error: standard icon theme…

Read the article
Language parsing to find important words

- by Matt Huggins

I'm looking for some input and theory on how to approach a lexical topic. Let's say I have a collection of strings, which may just be one sentence or potentially multiple sentences. I'd like to parse these strings to and rip out the most important words, perhaps with a score that denotes how likely the word is…

Read the article
Parsing mathematical experssions with two values that have parenthesis and minus signs

- by user45921

I'm trying to parse equations like these which only has two values or the square root of a certain value from a text file: 100+100 -100-100 -(100)+(-100) sqrt(100) by the minues signs, parenthesis and the operator symbol in the middle and the square root, and I have no idea how to start off... I've got the…

Read the article
Parsing a website's source

- by Davlog

I want to create an application and maybe upload it to the play store but I am not sure if that what my app does is legal or not. I am downloading a page's source from a website to get some information I need. For example if I download a page about the song "Ramble On" by Led Zeppelin and parse this page…

Read the article
PDF Text Extraction Approach Using OCR

- by Jon

Has anybody attempted to extract text from a PDF using an OCR library and Java? What did you find to be the most reliable library for text extraction. Most of the approaches I've seen (tesseract, GOCR) are C libraries that would require some JNI code to be written. I'm familiar with pdfbox, which is…

Read the article
View a pdf with quick webview though apache proxy

- by Musa

I have a site(IIS) that is accessed via a proxy in apache(on an IBM i). This site serves PDFs which has quick web view and if I access a pdf directly from the IIS server the PDFs starts to display immediately but if I go through the proxy I have to wait until the entire pdf downloads before I can view…

Read the article

< Previous Page | 1 2 3 4 5 6 7 8 9 10 11 12 | Next Page >