[bug] wrong import of pdf file

Hi,

I have tried to import a pdf file into LibreOffice draw, but the result is
not correct. The arrows that the pdf contained are out-of-place and all the
page is a mess. The pdf is showed correctly both in okular and in adobe
acrobat reader under windows. I have include two screenshots: one that show
the bug in LibreOffice and one that show okular showing the file correctly.
I have also attached the pdf document which cause this. the affected page is
page 3.

Cheers,
Raffaele

Hi,

thanks for your report.

I have tried to import a pdf file into LibreOffice draw, but the result is
not correct.

This is well-known.

http://de.openoffice.org/issues/show_bug.cgi?id=94306

Stefan

Which version of PDF import are you using? I just updated to 1.0.4
(http://extensions.services.openoffice.org/en/project/pdfimport) and the
results are pretty good with the ga.pdf listed in that bug:
http://www.openoffice.org/nonav/issues/showattachment.cgi/56834/GA.pdf

Screenshots here (all on linux):

1. OpenOffice.org Ubuntu'ized 3.2.1.4
http://img806.imageshack.us/img806/4209/screenshotgapdfopenoffi.png

$ cat /usr/lib/openoffice/program/versionrc
[Version]
AllLanguages=en-US
BuildVersion=ooo-build 3.2.1.4, Ubuntu package 1:3.2.1-7ubuntu1.1
buildid=320m19(Build:9505)
ExtensionUpdateURL=http://updateext.services.openoffice.org/ProductUpdateService/check.Update
OOOBaseVersion=3.2
ProductBuildid=9505
ProductMajor=320
ProductMinor=19
ProductSource=OOO320
UpdateID=OpenOffice.org_3_en-US
UpdateURL=
UpdateUserAgent=<PRODUCT> (${buildid}; ${_OS}; ${_ARCH};
BundledLanguages=${AllLanguages})
Vendor=Debian and Ubuntu

2. OpenOffice.org 3.3.0 (standard)
http://img543.imageshack.us/img543/4209/screenshotgapdfopenoffi.png

$ cat /opt/libreoffice/program/versionrc
[Version]
AllLanguages=en-US
buildid=330m19(Build:6)
ExtensionUpdateURL=http://updateexte.libreoffice.org/ExtensionUpdateService/check.Update
OOOBaseVersion=3.3
ProductBuildid=6
ProductMajor=330
ProductMinor=19
ProductSource=OOO330
UpdateID=LibreOffice_3_en-US
UpdateURL=
UpdateUserAgent=<PRODUCT> (${buildid}; ${_OS}; ${_ARCH};
BundledLanguages=${AllLanguages})

3. LibreOffice 3.3.0
http://img708.imageshack.us/img708/7186/screenshotgapdflibreoff.png

$ cat /opt/libreoffice/program/versionrc
[Version]
AllLanguages=en-US
buildid=330m19(Build:6)
ExtensionUpdateURL=http://updateexte.libreoffice.org/ExtensionUpdateService/check.Update
OOOBaseVersion=3.3
ProductBuildid=6
ProductMajor=330
ProductMinor=19
ProductSource=OOO330
UpdateID=LibreOffice_3_en-US
UpdateURL=
UpdateUserAgent=<PRODUCT> (${buildid}; ${_OS}; ${_ARCH};
BundledLanguages=${AllLanguages})

I've not had time to test on WinXP or Win7 today.

Screenshots & pdf attachments won't show appear on the list. Post your
screenshots to something like imageshack:
http://imageshack.us/
Post the pdf to a server that will take documents. Or send to me
directly & I'll be happy to test.

I cannot update the pdfimport extention from the link you gave me. I have
downloaded the extention for linux x86, open it and a message box appear
saying that "the extention does not work in this computer".

Here is the screenshot of the pdf showing in LibreOffice:
http://img97.imageshack.us/img97/5357/pdfinlibreoffice.png
Here is it in okular (as it should be):
http://img683.imageshack.us/img683/7338/pdfinokular.png
Here is the pdf: http://www.box.net/shared/pxiql9ux87

Hi,

BTW, what´s your name, NoOp?

http://de.openoffice.org/issues/show_bug.cgi?id=94306

Which version of PDF import are you using? I just updated to 1.0.4
(http://extensions.services.openoffice.org/en/project/pdfimport)

Ok. LibreOffice 3.3 comes with PDF Import 1.0.3, but I also just
updated to 1.0.4 and I still encounter lots of problems on my Ubuntu
Linux machine.

and the
results are pretty good with the ga.pdf listed in that bug:
http://www.openoffice.org/nonav/issues/showattachment.cgi/56834/GA.pdf

Try the other Bugdocs. :wink:

For example:

http://www.openoffice.org/nonav/issues/showattachment.cgi/56838/materialliste.pdf

Stefan

Hi Raffaele,

Here is the screenshot of the pdf showing in LibreOffice:
http://img97.imageshack.us/img97/5357/pdfinlibreoffice.png
Here is it in okular (as it should be):
http://img683.imageshack.us/img683/7338/pdfinokular.png
Here is the pdf: http://www.box.net/shared/pxiql9ux87

I do confirm the problem with LO 3.3 PDF Import Extension 1.0.4 on
Ubuntu Linux 10.10

Similar problems to those examples in issue 94306

Stefan

...
I see what you are referring to now. The issue is (I think) the vector
objects which as Stefan pointed out have been an issue for quite some
time. I see the same with the other PDF Stefan pointed to:
<http://www.openoffice.org/nonav/issues/showattachment.cgi/56838/materialliste.pdf>

The import filter does add a dislaimer:
http://extensions.services.openoffice.org/en/project/pdfimport
<quote>
Not supported:

    * Native PDF forms
    * Proper paragraphs
    * Processing layout of LaTeX PDF
    * Import of complex vector graphics elements
    * Conversion of tables
    * Import of EPS graphics
    * RTL (right-to-left) text/font support
</quote>
I know that doesn't help... but apparently it is what it is.

I have WinXP on a virtual machine & it has an old copy of Adobe Acrobat
6.0 on it. I opened the Lezione 1.pdf as well as the materialliste.pdf
in it. On both the problem areas are not 'graphics' but instead vector
objects. Even with Acrobat, when I move one of the green arrows (page
3), it moves the outline first & then the background of the object -
they are not grouped. So even there some added work is needed to modify
them. They are however, rendered properly in Acrobat.

The reason why you are seeing the proper rendering in Okular and/or
Adobe Reader is that those readers are rendering the xml data. The same
issue occurs with EPS (see the "EPS images in ODF documents" thread over
on the 'discuss' list); you see a thumbnail rendition in Writer, but
printing to PDF renders the EPS properly.

Sorry, there's not much else I can help with.

Gary

same problem with you on pdf import, what most important is, it didn't
support superscript!

Hi again,

<quote>
Not supported:

    * Native PDF forms
    * Proper paragraphs
    * Processing layout of LaTeX PDF
    * Import of complex vector graphics elements
    * Conversion of tables
    * Import of EPS graphics
    * RTL (right-to-left) text/font support
</quote>
I know that doesn't help... but apparently it is what it is.

Does this disclaimer also cover the problem, that in many cases you
get each single character as a separate object? I don´t think so.

This makes it practically impossible to just do a minor rephrasing
of the text, which would be a very common task:

http://www.openoffice.org/nonav/issues/showattachment.cgi/56838/materialliste.pdf

Stefan

If LibrO is based on OOo why does LibrO have these problems? My problem is using PDF files with LibrO as noted below.

I DO appreciate the efforts of the LibrO free developers, but can anyone fix these basic problems?

My complaint is not with all of the problems cited here. However, what about PDF compatibility?
This is critical.

Hi again,

<quote>
Not supported:

   * Native PDF forms
   * Proper paragraphs
   * Processing layout of LaTeX PDF
   * Import of complex vector graphics elements
   * Conversion of tables
   * Import of EPS graphics
   * RTL (right-to-left) text/font support
</quote>
I know that doesn't help... but apparently it is what it is.

Does this disclaimer also cover the problem, that in many cases you
get each single character as a separate object? I don´t think so.

This makes it practically impossible to just do a minor rephrasing
of the text, which would be a very common task:

http://www.openoffice.org/nonav/issues/showattachment.cgi/56838/materialliste.pdf

Stefan

--
LibreOffice - Die Freiheit nehm' ich mir!

--
Unsubscribe instructions: E-mail to users+help@libreoffice.org
List archive: http://listarchives.libreoffice.org/www/users/
*** All posts to this list are publicly archived for eternity ***

Glenn
glennst01@gmail.com

Hi Glenn,

If LibrO is based on OOo why does LibrO have these problems?

You will encounter the same problems with OOo.

These problems are not related to LibreOffice nor to OOo. PDF Import
is an extension from Oracle. You should direct your complaints to
the author of the extension, which is Oracle.

My complaint is not with all of the problems cited here. However,
what about PDF compatibility? This is critical.

PDF compatibility of OOo/LO is excellent. You can perfectly create
PDF files.

PDF file format by it´s definition is not meant to open it for
reediting. IMO the PDF import function should have never been
introduced to OOo/LO at all.

Stefan

I, for one, appreciate the pdf import thingy. But I'd say its purpose is more a tool to help changing small bits than anything else. One shouldn't use it for mass changes.

One option I find *very* interesting is the hybrid PDF format. It allows me to distribute files in PDF format for everyone, yet at the same time allow the few who want to edit them to do so without having to post two different file formats.

Hybrid? How do you access that?
Regards from
Tom :slight_smile:

Hi Tom,

Hybrid? How do you access that?

You need to have the PDF-Import Extension installed. Once you've done this, you can choose to save in the "hybrid" format, which includes the PDF and the .odt version of your file.

HTH

Sigrid

Hi again,

<quote>
Not supported:

    * Native PDF forms
    * Proper paragraphs
    * Processing layout of LaTeX PDF
    * Import of complex vector graphics elements
    * Conversion of tables
    * Import of EPS graphics
    * RTL (right-to-left) text/font support
</quote>
I know that doesn't help... but apparently it is what it is.

Does this disclaimer also cover the problem, that in many cases you
get each single character as a separate object? I don´t think so.

Stephan, I wasn't defending the extension issues... I filed the issue of
not being able to import EPS graphics back in 2008:
http://www.openoffice.org/issues/show_bug.cgi?id=90739
[* Summary: PDF Import not importing eps graphic]

I also don't use the extension for anything serious - such as filling
out forms:
<http://www.mail-archive.com/users@libreoffice.org/msg01162.html>

I was merely pointing out that the extension has limitations.

This makes it practically impossible to just do a minor rephrasing
of the text, which would be a very common task:

http://www.openoffice.org/nonav/issues/showattachment.cgi/56838/materialliste.pdf

No argument there. As I said "but apparently it is what it is."

File a bug and/or:
http://extensions.services.openoffice.org/en/project/pdfimport
"To further improve this extension, please continue sending us feedback
at dev@graphics.openoffice.org."

Hi,

One option I find *very* interesting is the hybrid PDF format.

I agree 100%.

But the naming of the PDF import extension and the way it has been
promoted by OOo marketing makes people beleive, that they can just
open any PDF file for editing. This creates disappointment, which
IMO is bad.

Stefan

Hi,

Stephan, I wasn't defending the extension issues... I filed the issue of
not being able to import EPS graphics back in 2008:
http://www.openoffice.org/issues/show_bug.cgi?id=90739
[* Summary: PDF Import not importing eps graphic]

I was merely pointing out that the extension has limitations.

This makes it practically impossible to just do a minor rephrasing
of the text, which would be a very common task:

http://www.openoffice.org/nonav/issues/showattachment.cgi/56838/materialliste.pdf

No argument there. As I said "but apparently it is what it is."

File a bug and/or:
http://extensions.services.openoffice.org/en/project/pdfimport
"To further improve this extension, please continue sending us feedback
at dev@graphics.openoffice.org."

Well, I *did* file the bug
(http://de.openoffice.org/issues/show_bug.cgi?id=94306).

:wink:

Stefan