Maarten van Gompel

Research software engineer - NLP - AI - 🐧 Linux & open-source enthusiast - 🐍 Python/ 🌊C/C++ / 🦀 Rust / 🐚 Shell - 🔐 InfoSec - https://git.sr.ht/~proycon

Projects

folia-rust

FoLiA library for rust (alpha)

Rust - Released: 08 Jun 2019 - 4

piereling

Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines.

Python - Released: 18 Oct 2019 - 5

procmapgen

A small toy project written in Rust: procedural generation of various kinds of grid-based maps.

Rust - Released: 05 Aug 2019 - 16

pbmbmt

Phrase-based Memory-based Machine Translation

Python - Released: 25 Oct 2010 - 10

babelente

BabelEnte: Entity Extractor and Translator using BabelFy and Babelnet.org

Python - Released: 13 Oct 2017 - 4

codemeta-server

Server for codemeta, in memory triple store, SPARQL endpoint and simple web-based visualisation for end-user

Python - Released: 25 Mar 2022 - 4

nederlab-pipeline

Linguistic enrichment pipeline for historical dutch, as used in the Nederlab project

Groovy - Released: 14 Jun 2019 - 7

valkuil-gecco

Nederlandse Spellingscontrole / Dutch spelling correction system - powered by Gecco

Python - Released: 11 Sep 2015 - 7

anavec

Proof-of-concept spelling correction/normalisation system based on anagram vectors

Python - Released: 09 Jun 2017 - 6

colibri

THIS PROJECT IS BEING RENDERED OBSOLETE BY NEWER VERSIONS colibri-core and colibri-mt !!

C++ - Released: 08 Oct 2011 - 7

ssam

split sampler: split your data into multiple sets (e.g. train/test/development)

Rust - Released: 04 Sep 2020 - 2

colibri-mt

A Machine Translation framework that wraps around the Moses Decoder and enables k-NN classifier techniques to be used for modelling source-side-context

C++ - Released: 30 Oct 2013 - 5