folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions

GPL-3.0 License

Downloads
2K
Stars
60
Committers
5

Bot releases are visible (Hide)

folia - v1.4.0.53

Published by proycon almost 8 years ago

FoLiA v1.4

  • Migrated FoLiA Set Definitions to RDF using the SKOS model (#14)
  • Deep validation is properly implemented now
    • Added an example documented for deep validation
    • Added an example document with deep morphology (Frog output)
  • Various set definitions are provided, most notably those by Frog
    • CGN Part-of-Speech tagset covered now (#18)
  • Constraining the number span roles (of a specific type) allowed under a span annotation
  • Updated documentation

FoLiA-Tools v1.4.0.53

  • Added foliasetdefinition tool, used to inspect and test set definitions, and to provide conversion from legacy XML sets to RDF SKOS sets
  • foliatextcontent prints warnings instead of raising NotImplementErrors now
  • Minor update to foliaspec (issue LanguageMachines/libfolia#8)
folia - v1.3.2.52

Published by proycon about 8 years ago

FoLiA v1.3.2 release, minor update:

  • Allow multiple foreign-data elements in metadata (issue #23)
  • RelaxNG schema fix for foreign-data in metadata
folia - v1.3.1.52

Published by proycon about 8 years ago

  • FoLiA v1.3.1 release, minor update over v1.3.0
    • Allow features on CoreferenceChain
    • Allow multiple sense annotations of the same set
folia - v1.3.0.52

Published by proycon about 8 years ago

FoLiA v1.3

  • Added a 'comment' element (higher-order annotation) (issue #10)
  • Added a label attribute on set definition (issue #11)
  • Added a predicate span annotation element that groups semantic roles (issue #13)
  • Added a new linguistic annotation type for various observations on a text (issue #15)
  • Added a new linguistic annotation type for sentiment analysis (issue #16), this is now favoured over the older and more limited subjectivity token annotation element.
  • Added a new linguistic annotation type for attribution (issue #17)
folia - v1.2.0.51

Published by proycon about 8 years ago

Folia Tools:

  • added foliatree tool, prints the structure of a document.
folia - v1.2.0.50

Published by proycon over 8 years ago

Minor bugfix release for foliaspec, previous release was premature

folia - v1.2.0.49

Published by proycon over 8 years ago

Minor bugfix release for foliaspec.

folia - FoLiA v1.2

Published by proycon over 8 years ago

FoLiA v1.2

  • Support for in-document metadata in any foreign namespace (see section 2.11 of the manual)
  • Support for foreign namespace annotation using the foreign-data element (see section 2.10.15 of the manual)
  • Native FoLiA metadata does not formally predefine any fields (but various tools may)
  • Extended linebreak element (br) with newpage,pagenr and linenr attributes. Element can now be used to signal page breaks. (see section 2.5.6 of the manual)
  • Alignments (alignment) now allows pointing to external documents that are not FoLiA, using the format attribute in combination with the already existing xlink:href attribute. (see section 2.10.8 of the manaul)
    • aref/@type is now a free field, the type needs not correspond to a FoLiA element anymore. Necessary to allow linking to external resources.
  • Similarly, references (ref) can now link to external resources using xlink attributes and the format attributes. (See section 2.5.12 of the manual)
  • Xlink support has been extended to include attributes like xlink:role etc.. These are now allowed on all xlink-capable elements.
  • Example document (test/example.xml) has been brought in line with some extra tests conducted by the libfolia library.
  • A small arabic example document was added (test/example.ar.xml), intended to test right-to-left visualisation.

FoLiA Tools v1.2.0.48

  • Improvements in folia2html (folia2html.xsl)
    • Support for right-to-left languages (based on explicit metadata field direction: rtl)
    • Explicit support for titles in dublin core format
    • Better propagation of metadata from FoLiA to HTML
    • Visualisation of some token annotation under morphemes
    • Render phonetics in annotation pop-up
folia - v1.0.1.47

Published by proycon over 8 years ago

Bugfix release:

  • Allow more optional attribs on annotation layers (was broken in v1.0.0 but frog NER and/or FLAT outputs it)
folia - FoLiA v1.0 release

Published by proycon over 8 years ago

FoLiA v1.0.0

  • New FoLiA specification in schemas/folia.yml, to be used by external libraries to facilitate implementation and synchronization
  • Minor fixes in documentation
  • Library implementation status updated in documentation
  • RelaxNG schema built from external specification (via pynlpl.formats.folia library)
  • Better legacy support in RelaxNG schema (e.g listitem tag)

FoLiA-tools (rev 47)

  • Added foliaspec tool that reads from external FoLiA specification and updates the library source code for pynlpl and libfolia
folia - v0.12.2.46

Published by proycon over 8 years ago

  • added ComplexAnnotation support in FoLiA schema
folia - v0.12.2.45

Published by proycon over 8 years ago

Release

Package Rankings
Top 12.6% on Pypi.org
Badges
Extracted from project README
tests documentation lamabadge DOI Project Status: Active – The project has reached a stable, usable state and is being actively developed.
Related Projects