Bot releases are visible (Hide)
This is a release with various bug-fixes and quality of life improvements but no new major features. It adds many of the supporting classes necessary for PDF rendering.
IColor
can now be of type PatternColor
. This implementation will throw an error when calling ToRGBValues()
. You might have to check for IColor.ColorSpace != ColorSpace.Pattern
before calling this functionDetails
suffix from ColorSpaceDetails
property namesAlternateColorSpaceDetails
renamed to AlternateColorSpace
BaseColorSpaceDetails
renamed to BaseColorSpace
IColor
implementationsdouble
instead of decimal
in color spaces and colorsIColorSpaceContext
from IOperationContext
to CurrentGraphicsState
ColorSpace
property from IPdfImage
. Use ColorSpaceDetails.Type
to get the enum valueIColorSpaceContext
's CurrentStrokingColorSpace
and CurrentNonStrokingColorSpace
are now of type ColorSpaceDetails
(not a ColorSpace
enum
anymore). Use CurrentStrokingColorSpace.Type
or CurrentNonStrokingColorSpace.Type
to get the enum
valueDefaultWordExtractor
, a logic bug in the existing implementation was fixed, meaning the output of the default page.GetWords()
may change in this versionNote that this version removes support for .NET 4.5. Consumers should upgrade to .NET 4.5.1 or 4.5.2
TextRenderingMode
, StrokeColor
and FillColor
PageSize
enum for landscape orientation documentsCreationDate
and ModifiedDate
are now available in DocumentInformationBuilder
PdfAction
exposed by Annotation
class. InReplyTo
property also addedGetFields
extensions method for AcroForm
typePdfDocumentBuilder
PdfDocumentBuilder
with one or more existing documentsRijndael
and RijndaelManaged
to Aes
since these were marked as obsoletePublished by EliotJones almost 2 years ago
Changes since 0.1.6:
page.SetRotation
for PdfPageBuilder
SkipMissingFonts
to parsing options to ignore content where the font is not present or corrupt. Can result in content being missed during extraction but will enable partial extraction of retrievable content on page for corrupted files.PdfPageBuilder
thanks to @JonowaGrahamScan
thanks to @BobLdDebugger.Break
from the encryption handlerPublished by EliotJones over 2 years ago
Mainly bug fixes. There are some compatibility changes in the document layout analysis API. See here: https://github.com/UglyToad/PdfPig/wiki/Migration-to-0.1.6
Published by EliotJones about 3 years ago
Changes since v0.1.4: https://github.com/UglyToad/PdfPig/compare/v0.1.4...v0.1.5
Published by EliotJones over 3 years ago
Some more bug-fixes:
NullToken
presence when creating documents.IPdfImage
.DefaultWordExtractor
to try and detect word gap size based on preceding text instead of a global gap threshold.Note that changes to DefaultWordExtractor
may change the output of calls to Page.GetWords()
in this version.
Published by EliotJones over 3 years ago
First alpha version of 0.1.5
page.GetOptionalContents()
partial optional content retrieval support.IPdfImage
s.Breaking changes:
PdfDocumentBuilder
now implements IDisposable
. This disposes the underlying stream by default but this is a MemoryStream
normally so not any serious consequences if left undisposed.PdfPageBuilder
had the AdvancedEditing
property removed. The API is now available in the ContentStream
methods / properties (this was from #250).Published by EliotJones almost 4 years ago
PdfDocumentBuilder
. The DrawRectangle
method now takes an optional boolean parameter, fill
.Arial MT
naming.endobj
tokens.Differences
arrays for encodings.Published by EliotJones almost 4 years ago
PointSize
for letters accounting for rotation and other transformationsPublished by EliotJones about 4 years ago
First alpha version of 0.1.3
Published by EliotJones over 4 years ago
Some new features, performance tweaks and improved Document Layout Analysis tools:
PdfDocumentBuilder
, use PdfDocumentBuilder.ArchiveStandard
to select a PDF/A compliance level.PdfPaths
, now PdfSubpath
. Use ParsingOptions.ClipPaths
to enable clipping.PdfMerger.Merge
to generate merged PDFs.IPdfImage
now supports TryGetBytes()
instead of Bytes
. TryGetBytes
returns false
for JPXDecode and DCTDecode image filters for which RawBytes
represent a valid JPEG image.Letter
.TextDirection
is now TextOrientation
, various fixes to the calculations of orientation and bounding box for Word
s.DlaOptions
parameter to specify behaviour.Published by EliotJones over 4 years ago
Published by EliotJones over 4 years ago
Adds letter font details and a couple of other bugfixes to the alpha version.
Published by EliotJones over 4 years ago
First alpha version of 0.1.2
Published by EliotJones over 4 years ago
Many bug fixes for a whole range of document types. In addition:
page.AddJpeg()
.page.GetMarkedContents()
PdfMerger.Merge()
Published by EliotJones over 4 years ago
A whole bunch of bug fixes and other changes.
Published by EliotJones almost 5 years ago
This version focuses on improving performance.
To enable this it replaces decimals with doubles for most of the public API. It also reorganizes the code internally to support access to font related classes.
For this reason consumers will need to update their code, see the migration guide on the wiki.
Other features:
Link
and their text content and destination. Use page.GetHyperlinks()
.document.Advanced.TryGetEmbeddedFiles(out IReadOnlyList<EmbeddedFile> files)
.ParsingOptions.Passwords
to provide the list of passwords. Any password set in ParsingOptions.Password
will be included in the list of passwords.Published by EliotJones almost 5 years ago
Updates the 0.1.0 beta version with many bug fixes.
Published by EliotJones almost 5 years ago
First release which moves internal numerics from decimal
to double
where appropriate.
Reorganises internal project structure.
See migration details in the wiki: https://github.com/UglyToad/PdfPig/wiki/Migration-0.0.X-to-0.1.0
Published by EliotJones almost 5 years ago
This release fixes a major performance regression in 0.10.0.
It also adds bug-fixes for several new issues as well as additional methods for the geometry objects PdfPath
, PdfLine
and PdfRectangle
.
Published by EliotJones almost 5 years ago
This release adds two main new features:
document.TryGetForm(out AcroForm form)
to get the form for the document if it contains one.document.TryGetBookmarks(out Bookmarks bookmarks)
to get the document's bookmarks tree if it contains one.It also aims to improve performance for most content retrieval operations resulting in up to double speed for the smallest documents.
It also adds bug-fixes, structure analysis tools and small improvements:
document.GetPages()
as a convenience method to enumerate all pages in a document.ITextExporter
interface and are used to export each page to a compatible string.page.GetImages()
method enumerates all images on a page, images are either InlineImage
s or XObjectImage
s.page.Text
on certain document types.