Researcher at @Microsoft | Previously: @ISI-NLP, @Apache, @USCDataScience @NASA
Fast Neural Machine Translation in C++ - development repository
C++ - Released: 03 May 2016 - 255
A tool that locates, downloads, and extracts machine translation corpora
Python - Released: 06 Apr 2020 - 139
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Python - Released: 27 May 2019 - 1,116
Meta's "No Language Left Behind" models served as web app and REST API
Python - Released: 27 Jul 2022 - 172
Efficient, check-pointed data loading for deep learning with massive data sets.
Python - Released: 27 Jun 2020 - 203
Tensorflow grpc java client for image recognition serving inception model
Java - Released: 28 Jun 2016 - 39
A place to dump all my homeworks and practice scribblings.
Jupyter Notebook - Released: 14 Sep 2015 - 4
Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser
Java - Released: 31 Oct 2015 - 13
Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika
Java - Released: 14 May 2017 - 14
awkg is an awk-like text-processing tool powered by python language
Python - Released: 22 Jul 2019 - 4
A toolkit for clustering web pages based on various similarity measures.
Java - Released: 25 Dec 2015 - 4
Finding the Optimal Vocabulary for NMT
Jupyter Notebook - Released: 27 May 2020 - 6