login about faq

I want to be able to read the text content of .pdf links..

I use this to get the html of the links


Chilkat.Http http = new Chilkat.Http(); // Send the HTTP GET and return the content in a string. string html; html = http.QuickGetStr("https://site.com/file.pdf");

Chilkat.Mime mime = new Chilkat.Mime();


string strPdfBody = mime.GetBodyDecoded();

With this I'm getting decoded text, but not the text that is on the .pdf.. Can tell me what I'm doing wrong?


asked Apr 27 '14 at 22:20

hsunjaya's gravatar image


AFAIK no Chilkat libraries can be used to parse out plain text from a PDF file. You will need to use a library specifically designed to get text out of a PDF file for this job. I use a library called QuickPDF personally, but there are others out there.


answered Apr 28 '14 at 21:11

jpbro's gravatar image

jpbro ♦

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or __italic__
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported



Asked: Apr 27 '14 at 22:20

Seen: 702 times

Last updated: Apr 28 '14 at 21:11

powered by OSQA