Search Results

Search found 4479 results on 180 pages for 'pdf scraping'.

Page 15/180 | < Previous Page | 11 12 13 14 15 16 17 18 19 20 21 22 | Next Page >

What is the best (offline)way to convert a big PDF into Word file (excluding images)?

- by metal gear solid

What is the best way to convert a big PDF into Word file (excluding images)? PDF has many images but in converted Word file i don't want those images, means in conversion process i want to exclude images. but i need text in same formatting like PDF. Any online tool will not work because PDF is too big. and i need instant result. don't wan to upload PDF and get in email.

Read the article
What is the best (offline) way to convert a big PDF into Word file (excluding images)?

- by metal gear solid

What is the best way to convert a big PDF into Word file (excluding images)? PDF has many images but in converted Word file i don't want those images, means in conversion process i want to exclude images. but i need text in same formatting like PDF. Any online tool will not work because PDF is too big. and i need instant result. don't wan to upload PDF and get in email.

Read the article
Convert PDF to Image Batch

- by tro

I am working on a solution where I can convert pdf files to images. I am using the following example from codeproject: http://www.codeproject.com/Articles/317700/Convert-a-PDF-into-a-series-of-images-using-Csharp?msg=4134859#xx4134859xx now I tried with the following code to generate from more then 1000 pdf files new images: using Cyotek.GhostScript; using Cyotek.GhostScript.PdfConversion; using System; using System.Collections.Generic; using System.Drawing; using System.IO; using System.Linq; using System.Text; using System.Threading.Tasks; namespace RefClass_PDF2Image { class Program { static void Main(string[] args) { string outputPath = Properties.Settings.Default.outputPath; string pdfPath = Properties.Settings.Default.pdfPath; if (!Directory.Exists(outputPath)) { Console.WriteLine("Der angegebene Pfad " + outputPath + " für den Export wurde nicht gefunden. Bitte ändern Sie den Pfad (outputPath) in der App.Config Datei."); return; } else { Console.WriteLine("Output Pfad: " + outputPath + " gefunden."); } if (!Directory.Exists(pdfPath)) { Console.WriteLine("Der angegebene Pfad " + pdfPath + " zu den PDF Zeichnungen wurde nicht gefunden. Bitte ändern Sie den Pfad (pdfPath) in der App.Config Datei."); return; } else { Console.WriteLine("PDF Pfad: " + pdfPath + " gefunden."); } Pdf2ImageSettings settings = GetPDFSettings(); DateTime start = DateTime.Now; TimeSpan span; Console.WriteLine(""); Console.WriteLine("Extraktion der PDF Zeichnungen wird gestartet: " + start.ToShortTimeString()); Console.WriteLine(""); DirectoryInfo diretoryInfo = new DirectoryInfo(pdfPath); DirectoryInfo[] directories = diretoryInfo.GetDirectories(); Console.WriteLine(""); Console.WriteLine("Es wurden " + directories.Length + " verschiedende Verzeichnisse gefunden."); Console.WriteLine(""); List<string> filenamesPDF = Directory.GetFiles(pdfPath, "*.pdf*", SearchOption.AllDirectories).Select(x => Path.GetFullPath(x)).ToList(); List<string> filenamesOutput = Directory.GetFiles(outputPath, "*.*", SearchOption.AllDirectories).Select(x => Path.GetFullPath(x)).ToList(); Console.WriteLine(""); Console.WriteLine("Es wurden " + filenamesPDF.Count + " verschiedende PDF Zeichnungen gefunden."); Console.WriteLine(""); List<string> newFileNames = new List<string>(); int cutLength = pdfPath.Length; for (int i = 0; i < filenamesPDF.Count; i++) { string temp = filenamesPDF[i].Remove(0, cutLength); temp = outputPath + temp; temp = temp.Replace("pdf", "jpg"); newFileNames.Add(temp); } for (int i = 0; i < filenamesPDF.Count; i++) { FileInfo fi = new FileInfo(newFileNames[i]); if (!fi.Exists) { if (!Directory.Exists(fi.DirectoryName)) { Directory.CreateDirectory(fi.DirectoryName); } Bitmap firstPage = new Pdf2Image(filenamesPDF[i], settings).GetImage(); firstPage.Save(newFileNames[i], System.Drawing.Imaging.ImageFormat.Jpeg); firstPage.Dispose(); } //if (i % 20 == 0) //{ // GC.Collect(); // GC.WaitForPendingFinalizers(); //} } Console.ReadLine(); } private static Pdf2ImageSettings GetPDFSettings() { Pdf2ImageSettings settings; settings = new Pdf2ImageSettings(); settings.AntiAliasMode = AntiAliasMode.Medium; settings.Dpi = 150; settings.GridFitMode = GridFitMode.Topological; settings.ImageFormat = ImageFormat.Png24; settings.TrimMode = PdfTrimMode.CropBox; return settings; } } } unfortunately, I always get in the Pdf2Image.cs an out of memory exception. here the code: public Bitmap GetImage(int pageNumber) { Bitmap result; string workFile; //if (pageNumber < 1 || pageNumber > this.PageCount) // throw new ArgumentException("Page number is out of bounds", "pageNumber"); if (pageNumber < 1) throw new ArgumentException("Page number is out of bounds", "pageNumber"); workFile = Path.GetTempFileName(); try { this.ConvertPdfPageToImage(workFile, pageNumber); using (FileStream stream = new FileStream(workFile, FileMode.Open, FileAccess.Read)) { result = new Bitmap(stream); // --->>> here is the out of memory exception stream.Close(); stream.Dispose(); } } finally { File.Delete(workFile); } return result; } how can I fix that to avoid this exception? thanks for any help, tro

Read the article
Alert Log Scraping with Oracle&#146;s ADRCI Utility

Oracles new ADR with command interface shows promise for database administrators who like to script their own solution for quickly scraping the alert log and automatically looking for errors.

Read the article
How to pretend to be a Printer on iOS like the Apps Save2PDF or Adobe® CreatePDF?

- by Lindemann

I want to convert HTML to PDF on an iOS Device... ...but I dont want to load my HTML in a UIWebView, take a snapshot and generate an ugly PDF from this snapshot picture...because the text must be selectable for my purpose. I wonder how Apps like Save2PDF or Adobe® CreatePDF are able to converting multiple files into PDF and save them. I guess they don't generate the PDF's by their own, but get them from Apples Printing Framework. How does this Apps work?

Read the article
Cannot copy non-latin characters from PDF document

- by user17381

Hi, I have a pdf file which contains some non-latin european characters. If I copy some text with the highlight tool, and paste it into another program (word, notepad) - the 'special' characters do not transfer correctly (I get other odd characters in their place). I have tried copying the text from both Acrobat Reader and Foxit. Is there anything I can do here to copy this? Thanks

Read the article
Split pages of PDF in Adobe Acrobat Pro

- by Lukas

I have a scanned PDF (two vertical pages on one horizonatal page). How can I split them to be single pages in Adobe Acrobat Pro Extended?

Read the article
PDF has garbled text when copy pasting

- by ngm

I'm trying to copy and paste text from a PDF file. However, whenever I paste the original text it is a huge mess of garbled characters. The text looks like the following (this is just one small extract): 4$/)5=$13! ,4&1*%-! )5'$! 1$2$)&,$40! 65))! .*5)1! -#$! )/'8*/8$03! (4/+$6&4;0!/'1!-&&)0!*0$1!.9!/,,)5%/-5&'!1$2$)&,$403!5'!+*%#!-#$! 0/+$!6/9! -#/-! &,$4/-5'8! 090-$+! 1$2$)&,$40! .*5)1!1$25%$! 1452$40! /'1! &-#$4! 090-$+! 0&(-6/4$! %&+,&'$'-0! *0$1! .9! /,,)5%/-5&'! 1$2$)&,$40!-&1/97!"#$!+5M!&(!,4&1*%-!)5'$!/'1!,4&1*%-!1$2$)&,$40! 65))! .$!+*%#!+&4$! $2$')9! ./)/'%$13! #&6$2$43! -#/'! -#$!+5M! &(! &,$4/-5'8!090-$+!/'1!/,,)5%/-5&'!1$2$)&,$40!-&1/97! )*+*+, C<88,?>8513AG<5A14, I've tried it in both Adobe and Foxit PDF readers. I did a 'Save as text' in Adobe Reader and the resultant text file is the same garbled text. Any ideas how I can get this text out non-garbled? (Other than manual typing... there's a lot of text to extract.)

Read the article
Changing ZapfDingbat font character in a PDF file

- by understack

I've got a PDF file which has a single 'ZapfDingbat' font character (a big tick mark) in it. I opened it in Adobe Acrobat Professional and tried to use 'touch up' tool to change this character. But I can't for some reason. How do I go about this?

Read the article
Print a PDF book on line. [closed]

- by microspino

I'd like to print my PDF copy of "why's poignant guide to ruby" to read It on paper before to sleep. I have several open source book I'd like to print too and some of them are full color. I know about lulu.com but I never had any experience with It. Can you give me some advice with real world proofs about on-line-print-and ship-to-your house services?

Read the article
Adjust PDF Contrast with Adobe Reader/Acrobat?

- by wag2639

We get PDF's from our professor to read for homework but they're often scanned documents, is there a way to adjust the contrast of the text to make it easier to read?

Read the article
Adobe Reader: remember password for a PDF file

- by splattne

I've bought some ebooks - password protected PDF files. Since the passwords are very long I wish Adobe Reader could save them for future usage. Does anybody know a solution or workaround for this problem? Thanks!

Read the article
Convert PDF to images

- by matfiz

What's the best free tool to convert PDF to images? I would even prefer to have printer driver able to print to images directly.

Read the article
Printing Large PDF from Outlook 2003

- by mrach

Whenever I try to print an attached oversized PFF sheet (larger then letter sized) from Outlook, the print is cut off. How can I configure Outlook to automatically fit the PDF to page sized with out having to open it up in Adobe Reader?

Read the article
Saving blog items as pdf's

- by ldigas

I know of FireShot, a firefox extension, for saving up whole pages of images. And I love it. Great idea, and a very good implementation. But unfortunatelly, often on this kind of sites, you have links which get lost that way. So I'm wondering, is there a way to save in the same manner whole blog posts, wiki posts, StackOverflow posts :), as PDF files, so the links get saved as well ?

Read the article
Chrome for Linux doesn't display pdf's

- by Grzenio

Hi, I am using latest chrome package for Debian and it doesn't seem to handle acrobat plugin correctly - when I open a pdf there is just nothing on the screen (kind of dark grey colour). How can I configure it to load acrobat correctly?

Read the article
Search PDF files from command line or using scripts

- by arunkumar.cr

Is there a way to search for some text or keyword in PDF files? I need to do it from the command line or by using some scripts.

Read the article
Clear Fillable Online PDF Files

- by May

I filled in an online pdf form. After I finished, I closed the window thinking that the form will clear itself since I didn't save it. When I went back to the website and clicked on the form again, it still had the information that I entered. Other than manually deleting all the information on the form, is there another way of clearing the form?

Read the article
Best way to review pdf documents

- by Anders Rasmussen

I'm looking for an easy way to get my pdf document reviewed. I would prefer an online solution, where I just upload my document and then sent out an url to my reviewers. They can then give comments through the website without any special software installed.

Read the article
PDF has garbled text when copy pasting

- by ngm

I'm trying to copy and paste text from a PDF file. However, whenever I paste the original text it is a huge mess of garbled characters. The text looks like the following (this is just one small extract): 4$/)5=$13! ,4&1*%-! )5'$! 1$2$)&,$40! 65))! .*5)1! -#$! )/'8*/8$03! (4/+$6&4;0!/'1!-&&)0!*0$1!.9!/,,)5%/-5&'!1$2$)&,$403!5'!+*%#!-#$! 0/+$!6/9! -#/-! &,$4/-5'8! 090-$+! 1$2$)&,$40! .*5)1!1$25%$! 1452$40! /'1! &-#$4! 090-$+! 0&(-6/4$! %&+,&'$'-0! *0$1! .9! /,,)5%/-5&'! 1$2$)&,$40!-&1/97!"#$!+5M!&(!,4&1*%-!)5'$!/'1!,4&1*%-!1$2$)&,$40! 65))! .$!+*%#!+&4$! $2$')9! ./)/'%$13! #&6$2$43! -#/'! -#$!+5M! &(! &,$4/-5'8!090-$+!/'1!/,,)5%/-5&'!1$2$)&,$40!-&1/97! )*+*+, C<88,?>8513AG<5A14, I've tried it in both Adobe and Foxit PDF readers. I did a 'Save as text' in Adobe Reader and the resultant text file is the same garbled text. Any ideas how I can get this text out non-garbled? (Other than manual typing... there's a lot of text to extract.)

Read the article
How to convert a really big HTML file to PDF in Windows

- by PeterStrange

We have a few really large HTML files (60-100 MB) that we cannot convert to PDF with any reliability. Adobe Acrobat 9 crashes - hits the 2GB limit for applications. Open Office converts, but removes some of the anchors (). ActivePDF webgrabber crashes. Is using a 64 bit situation an option for this type of thing? I see a bunch of options out there, but can they do better than Adobe Acrobat 9 itself?

Read the article
PDF form (not) saving

- by gregseth

Hi, I've created a form in a PDF with Adobe Acrobat Pro. When empy, I want to use it as a template which the user opens, fills in, and saves as a copy to preserve the blank state of the template. Here's the trick : I found both ways to make the document read only - the user can't save the form value, only print them make the document writeable, but in this case the document acting as a template can be modified too. Any ideas? Thanks.

Read the article
Convert PDF to images (windows)

- by matfiz

Windows. What's the best free tool to convert PDF to images? I would even prefer to have printer driver able to print to images directly.

Read the article
Covert wps file to PDF?

- by Steve Folly

I have a wps file (Microsoft Works), but no Microsoft Works, and no Microsoft Office. Ultimately, I want to get a PDF but happy to convert to an intermediate format if I can. Preferably on OS X if possible, but would use a Windows solution if there is one. Is there a converter tool I can use for this? Thanks.

Read the article
Free PDF Editor for Windows

- by Brett Veenstra

I'd like to edit PDF files while running on Windows. What is the best package to get this for free (either freeware or opensource, not trial/shareware)?

Read the article

< Previous Page | 11 12 13 14 15 16 17 18 19 20 21 22 | Next Page >