Thamme Gowda

Researcher at @Microsoft | Previously: @ISI-NLP, @Apache, @USCDataScience @NASA

Projects

marian-dev

Fast Neural Machine Translation in C++ - development repository

C++ - Released: 03 May 2016 - 255

mtdata

A tool that locates, downloads, and extracts machine translation corpora

Python - Released: 06 Apr 2020 - 139

MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Python - Released: 27 May 2019 - 1,116

nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API

Python - Released: 27 Jul 2022 - 172

infinibatch

Efficient, check-pointed data loading for deep learning with massive data sets.

Python - Released: 27 Jun 2020 - 203

tensorflow-grpc-java

Tensorflow grpc java client for image recognition serving inception model

Java - Released: 28 Jun 2016 - 39

notes

A place to dump all my homeworks and practice scribblings.

Jupyter Notebook - Released: 14 Sep 2015 - 4

charliebot

The ALICE/ALICEBOT/CHARLIE/CHARLIEBOT

Java - Released: 21 Jan 2015 - 18

tika-ner-corenlp

Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser

Java - Released: 31 Oct 2015 - 13

tika-dl4j-spark-imgrec

Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika

Java - Released: 14 May 2017 - 14

awkg

awkg is an awk-like text-processing tool powered by python language

Python - Released: 22 Jul 2019 - 4

006-many-to-eng

Machine translation of many to English

Jupyter Notebook - Released: 06 Jun 2020 - 9

autoextractor

A toolkit for clustering web pages based on various similarity measures.

Java - Released: 25 Dec 2015 - 4

005-nmt-imbalance

Finding the Optimal Vocabulary for NMT

Jupyter Notebook - Released: 27 May 2020 - 6

unmass

Unsupervised NMT based on Masked Seq-to-Seq

Python - Released: 15 Jul 2020 - 2

summary

Research Summaries

CSS - Released: 08 Jun 2019 - 1

011-imb-learn

Imbalanced Learning

Python - Released: 24 Apr 2021 - 1