On Thu, Jun 09, 2016 at 05:23:02PM -0400, Robert Moskowitz wrote:
On 06/09/2016 03:25 PM, Fred Smith wrote:
On Thu, Jun 09, 2016 at 02:11:26PM -0400, Robert Moskowitz wrote:
On 06/09/2016 02:01 PM, Samuel Sieb wrote:
On 06/09/2016 10:55 AM, Robert Moskowitz wrote:
On 06/09/2016 01:38 PM, Samuel Sieb wrote:
On 06/09/2016 09:34 AM, Robert Moskowitz wrote: >What other tool can read in pdfs and provide selecting an >image (e.g. a >figure in an IEEE standard) that I can then copy over to Libre Office? > Why don't you just use LibreOffice to open the pdf?
When I try opening IEEE 802.1AE-2006 pdf, it hangs. And it is only a 142pg document.
Ok, I've never tried opening one that big. And since that file is not publicly available, I can't test it.
http://standards.ieee.org/getieee802/download/802.1AE-2006.pdf
All 802 standards are available free 6 months after publication. We 802 attendees pay the IEEE for this in our conference attendance fee.
see:
you can use pdfseparate to extract the page you're interested int, then pdfimages to get the images on that page.
with the file you point to, for example, the image from page 86 can be extracted like this:
pdfseparate -f 86 -l 86 8*pdf fred pdfimages -f 1 -l 1 -png fred foo-%d
Yes, that gets fig 12-1, but.
I was able to extract pg 39 for fig 7-7 to a file fred.pdf, but the pdfimages did not create a foo-1 file.
There's something weird about that document, there a number of figures that do not show up in the pdfimages output. here's what it lists for the entire document:
page num type width height color comp bpc enc interp object ID x-ppi y-ppi size ratio -------------------------------------------------------------------------------------------- 1 0 image 459 164 index 1 8 jpx no 2737 0 257 257 9714B 13% 1 1 image 459 164 index 1 8 jpx no 2738 0 257 257 12.9K 18% 1 2 stencil 394 186 - 1 1 ccitt no 2739 0 301 301 436B 4.8% 1 3 stencil 394 184 - 1 1 ccitt no 2740 0 301 300 398B 4.4% 1 4 stencil 387 182 - 1 1 ccitt no 2741 0 301 300 413B 4.7% 1 5 stencil 387 55 - 1 1 ccitt no 2742 0 301 300 61B 2.3% 1 6 stencil 387 116 - 1 1 ccitt no 2743 0 301 300 139B 2.5% 1 7 stencil 387 256 - 1 1 ccitt no 2744 0 301 301 515B 4.2% 1 8 stencil 387 205 - 1 1 ccitt no 2745 0 301 300 220B 2.2% 1 9 stencil 96 53 - 1 1 ccitt no 2734 0 301 301 65B 10% 1 10 stencil 96 63 - 1 1 ccitt no 2735 0 301 300 118B 16% 52 11 image 670 104 index 1 8 jpx no 155 0 179 150 7267B 10% 86 12 image 675 407 index 1 8 jpx no 259 0 120 120 41.3K 15%
so either pdfimages is busted, or some of the figures in that document are stored/created in some unusual way. I don't know enough about PDF internals to have a clue.
Sorry I can't be of more help.
Fred