PDF: Difference between revisions

[pending revision][accepted revision]
Content deleted Content added
m grammar
m Reverting possible vandalism by 182.239.164.240 to version by Annh07. Report False Positive? Thanks, ClueBot NG. (4335440) (Bot)
(22 intermediate revisions by 16 users not shown)
Line 58:
In April 2023 the PDF Association made ISO 32000-2 available for download free of charge.<ref name=nowfree>{{cite press release| title=Announcing no-cost access to the latest PDF standard: ISO 32000-2 (PDF 2.0)| publisher=PDF Association| url=https://pdfa.org/sponsored-standards| date=16 June 2023| orig-date=Updated; originally published 5 April 2023| access-date=October 6, 2023| archive-date=September 23, 2023| archive-url=https://web.archive.org/web/20230923202322/https://pdfa.org/sponsored-standards/| url-status=live}}</ref>
 
== {{Anchor|Technical foundations}}Technical details ==
 
A PDF file is often a combination of [[vector graphics]], text, and [[bitmap graphics]]. The basic types of content in a PDF are:
Line 91:
Its disadvantages are:
 
* LossA loss of flexibility, and limitation to a single use case.{{citation needed|date=December 2023}}
* A (sometimes much) larger file size.<ref>{{cite web |last1=Anton Ertl |first1=Martin |title=What is the PDF format good for? |url=https://www.complang.tuwien.ac.at/anton/why-not-pdf.html |website=www.complang.tuwien.ac.at |publisher=Vienna University of Technology |access-date=8 April 2024|archive-url=https://web.archive.org/web/20240404031526/https://www.complang.tuwien.ac.at/anton/why-not-pdf.html|archive-date=4 April 2024|url-status=live}}</ref>
 
PDF since v1.6 supports embedding of interactive 3D documents: 3D drawings can be embedded using [[U3D]] or [[PRC (file format)|PRC]] and various other data formats.<ref name="3d#1">{{cite web |url=https://www.adobe.com/manufacturing/resources/3dformats/ |title=3D supported formats |publisher=Adobe Systems Inc. |date=July 14, 2009 |access-date=February 21, 2010
Line 130:
There are two layouts to the PDF files: non-linearized (not "optimized") and linearized ("optimized"). Non-linearized PDF files can be smaller than their linear counterparts, though they are slower to access because portions of the data required to assemble pages of the document are scattered throughout the PDF file. Linearized PDF files (also called "optimized" or "web optimized" PDF files) are constructed in a manner that enables them to be read in a Web browser plugin without waiting for the entire file to download, since all objects required for the first page to display are optimally organized at the start of the file.<ref name="pdf-ref">{{cite web|url=https://www.adobe.com/devnet/pdf/pdf_reference.html|title=Adobe Developer Connection: PDF Reference and Adobe Extensions to the PDF Specification|publisher=Adobe Systems Inc.|access-date=December 13, 2010|url-status=dead|archive-url=https://web.archive.org/web/20061115132507/https://www.adobe.com/devnet/pdf/pdf_reference.html|archive-date=November 15, 2006}}</ref> PDF files may be optimized using [[Adobe Acrobat]] software or [[QPDF]].
 
Page dimensions are not limited by the format itself. However, Adobe Acrobat imposes a limit of 15 million by 15 million inches, or 225 trillion in<sup>2</sup> (145,161&nbsp;km<sup>2</sup>).<ref name="pdf-ref-1.7" />{{rp|1129}}
 
== Imaging model ==
Line 194:
== Additional features ==
 
=== Logical structure and accessibility<span class="anchor" id="Tagged PDF"></span> ===
{{See also|PDF/A-1|PDF/UA}}
 
A "'''tagged" PDF'''<!--boldface per [[WP:R#PLA]]--> (see clause 14.8 in ISO 32000) includes document structure and semantics information to enable reliable text extraction and [[accessibility]].<ref>{{cite web |title=Tagged PDF Best Practice Guide: Syntax |url=https://pdfa.org/wp-content/uploads/2019/06/TaggedPDFBestPracticeGuideSyntax.pdf |website=pdfa.org |publisher=[[PDF Association]] |date=June 2019 |access-date=2024-06-24}}</ref> Technically speaking, tagged PDF is a stylized use of the format that builds on the logical structure framework introduced in PDF 1.3. Tagged PDF defines a set of standard structure types and attributes that allow page content (text, graphics, and images) to be extracted and reused for other purposes.<ref>{{cite web|first=Duff|last=Johnson|date=April 22, 2004|title=What is Tagged PDF?|url=https://www.talkingpdf.org/what-is-tagged-pdf/|url-status=live|archive-url=https://web.archive.org/web/20040807132851/http://www.planetpdf.com/enterprise/article.asp?ContentID=6067|archive-date=August 7, 2004}}</ref>
 
Tagged PDF is not required in situations where a PDF file is intended only for print. Since the feature is optional, and since the rules for Taggedtagged PDF were relatively vague in ISO 32000-1, support for tagged PDF among consuming devices, including [[assistive technology]] (AT), is uneven as of 2021.<ref>{{Cite web|title=Is PDF accessible?|website=DO-IT - Disabilities, Opportunities, Internetworking, and Technology|publisher=University of Washington|date=October 4, 2022|url=https://www.washington.edu/doit/pdf-accessible?1002=|access-date=January 12, 2023|archive-date=February 10, 2023|archive-url=https://web.archive.org/web/20230210114239/https://www.washington.edu/doit/pdf-accessible?1002=|url-status=live}}</ref> ISO 32000-2, however, includes an improved discussion of tagged PDF which is anticipated to facilitate further adoption.
 
An ISO-standardized subset of PDF specifically targeted at accessibility, [[PDF/UA]], was first published in 2012.
Line 309 ⟶ 310:
There are many software options for creating PDFs, including the PDF printing capabilities built into [[macOS]], [[iOS]],<ref>{{Cite web|url=https://ijunkie.com/how-to-create-pdf-web-page-safari-iphone-ipad-ios-11/|title=How to Create a PDF from Web Page on iPhone and iPad in iOS 11|last=Pathak|first=Khamosh|date=October 7, 2017|website=iJunkie|access-date=January 12, 2023|archive-date=January 12, 2023|archive-url=https://web.archive.org/web/20230112153246/https://ijunkie.com/how-to-create-pdf-web-page-safari-iphone-ipad-ios-11/|url-status=live}}</ref> and most [[Linux]] distributions. Much document processing software including [[LibreOffice]], [[Microsoft Office 2007]] (if updated to [[Office 2007#Service Pack 2|SP2]]) and later,<ref>{{cite web|url=http://support.microsoft.com/kb/953195|title=Description of 2007 Microsoft Office Suite Service Pack 2 (SP2)|publisher=[[Microsoft]]|url-status=dead|archive-url=https://web.archive.org/web/20090429212434/http://support.microsoft.com/kb/953195|archive-date=April 29, 2009|access-date=January 12, 2023}}</ref> [[WordPerfect]] 9, and [[Scribus]] can export documents in PDF format. There are many PDF print drivers for Microsoft Windows, the [[pdfTeX]] typesetting system, the [[DocBook]] PDF tools, applications developed around [[Ghostscript]] and [[Adobe Acrobat]] itself as well as [[Adobe InDesign]], [[Adobe FrameMaker]], Adobe Illustrator, Adobe Photoshop, that allow a "PDF printer" to be set up, which when selected sends output to a PDF file instead of a physical printer. [[Google]]'s online office suite [[Google Docs]] allows uploading and saving to PDF. Some web apps offer free PDF editing and annotation tools.
 
The [[Free Software Foundation]] werewas "developing a free, high-quality and fully functional set of libraries and programs that implement the PDF file format and associated technologies to the ISO 32000 standard", as one of theirits [[High priority free software projects|high priority projects]].<ref>On 2014-04-02, a note dated February 10, 2009 referred to [http://www.fsf.org/campaigns/priority.html Current FSF High Priority Free Software Projects] {{Webarchive|url=https://web.archive.org/web/20070810230457/http://www.fsf.org/campaigns/priority.html |date=August 10, 2007 }} as a source. Content of the latter page, however, changes over time.</ref><ref>{{cite web|url=http://gnupdf.org/Goals_and_Motivations|title=Goals and Motivations|publisher=GNUpdf|date=November 28, 2007|website=gnupdf.org|access-date=April 2, 2014|archive-date=July 4, 2014|archive-url=https://web.archive.org/web/20140704114405/http://www.gnupdf.org/Goals_and_Motivations|url-status=live}}</ref> In 2011, however, the GNU PDF project was removed from the list of "high priority projects" due to the maturation of the [[Poppler (software)|Poppler library]],<ref>{{cite web|title=GNU PDF project leaves FSF High Priority Projects list; mission complete!|url=http://www.fsf.org/blogs/community/gnu-pdf-project-leaves-high-priority-projects-list-mission-complete|date=October 6, 2011|first=Matt|last=Lee|publisher=Free Software Foundation|website=fsf.org|archive-date=December 28, 2014|archive-url=https://web.archive.org/web/20141228050435/http://www.fsf.org/blogs/community/gnu-pdf-project-leaves-high-priority-projects-list-mission-complete|url-status=live}}</ref> which has enjoyed wider use in applications such as [[Evince]] with the [[GNOME]] desktop environment. Poppler is based on [[Xpdf]]<ref>{{cite web|url=http://poppler.freedesktop.org/|title=Poppler Homepage|quote=Poppler is a PDF rendering library based on the xpdf-3.0 code base.|access-date=January 12, 2023|archive-date=January 8, 2015|archive-url=https://web.archive.org/web/20150108235708/http://poppler.freedesktop.org/|url-status=live}}</ref><ref>{{cite web|url=http://cgit.freedesktop.org/poppler/poppler/tree/README-XPDF|title=Xpdf License|quote=Xpdf is licensed under the GNU General Public License (GPL), version 2 or 3.|access-date=January 12, 2023|archive-date=April 14, 2013|archive-url=https://archive.today/20130414194348/http://cgit.freedesktop.org/poppler/poppler/tree/README-XPDF|url-status=live}}</ref> code base. There are also commercial development libraries available as listed in [[List of PDF software]].
 
The [[Apache PDFBox]] project of the [[Apache Software Foundation]] is an open source Java library, licensed under the [[Apache License]], for working with PDF documents.<ref>{{cite web|url=http://pdfbox.apache.org/|url-status=live|title=The Apache PDFBox project- Apache PDFBox 3.0.0 released|date=August 17, 2023|archive-date=January 7, 2023|archive-url=https://web.archive.org/web/20230107234923/https://pdfbox.apache.org/}} Updated for new releases.</ref>
Line 317 ⟶ 318:
[[Raster image processor]]s (RIPs) are used to convert PDF files into a [[raster graphics|raster format]] suitable for imaging onto paper and other media in printers, digital production presses and [[prepress]] in a process known as [[rasterization]]. RIPs capable of processing PDF directly include the Adobe PDF Print Engine<ref>{{cite web|url=https://www.adobe.com/products/pdfprintengine/overview.html|title=Adobe PDF Print Engine|publisher=Adobe Systems Inc.|access-date=August 20, 2014|archive-date=August 22, 2013|archive-url=https://web.archive.org/web/20130822034446/http://www.adobe.com/products/pdfprintengine/overview.html|url-status=live}}</ref> from Adobe Systems and Jaws<ref>{{cite web|url=http://www.globalgraphics.com/products/jaws_rip/|title=Jaws® 3.0 PDF and PostScript RIP SDK|work=globalgraphics.com|access-date=November 26, 2010|archive-date=March 5, 2016|archive-url=https://web.archive.org/web/20160305090728/http://globalgraphics.com/products/jaws_rip|url-status=dead}}</ref> and the [[Harlequin RIP]] from [[Global Graphics]].
 
In 1993, the Jaws raster image processor from Global Graphics became the first shipping prepress RIP that interpreted PDF natively without conversion to another format. The company released an upgrade to theirits Harlequin RIP with the same capability in 1997.<ref>{{cite web |url = http://www.globalgraphics.com/products/harlequin-multi-rip |title=Harlequin MultiRIP|access-date=March 2, 2014|url-status=dead|archive-url=https://web.archive.org/web/20140209215413/http://www.globalgraphics.com/products/harlequin-multi-rip/|archive-date=February 9, 2014 }}</ref>
 
[[Agfa-Gevaert]] introduced and shipped Apogee, the first prepress workflow system based on PDF, in 1997.