vmf_loss

PyTorch Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"

MIT License

Stars

6

View Code on GitHub View on X

Ecosystems: Python

This repository contains an unofficial implementation of the paper

Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs Sachin Kumar and Yulia Tsvetkov

Requirements

Python 3.7
Pytorch 1.0
torchtext 0.3.1
sacremoses 0.0.10
sacrebleu 1.2.17
fast-align

Quick Start

Preprocessing the data

IWSLT data

  bash scripts/get_data.sh
  bash scripts/tokenize.sh
  bash scripts/bpeize.sh

(For cross-entropy training) Word alignment

  bash scripts/align_data.sh
  python3 dicts_from_alignment.py --datasets de-en,en-fr,fr-en

WMT data for embeddings

  bash scripts/get_data_wmt.sh
  bash scripts/tokenize_wmt.sh

Training

  python3 train.py --dataset de-en --token-type word --loss vmf --emb-type w2v --tied --reg1 1e-3 --reg2 0.1

	Options:
	 --dataset {de-en,en-fr,fr-en}
	 --token-type {word,bpe,word_bpe}
	 --loss {xent,l2,cosine,maxmarg,vmfapprox_paper,vmfapprox_fixed,vmf}
	 --batch-size BATCH_SIZE
	 --num-epoch NUM_EPOCH
	 --lr LR
	 --emb-type {w2v,fasttext}
	 --emb-dir EMB_DIR
	 --device-id DEVICE_ID
	 --reg_1 REG_1
	 --reg_2 REG_2
	 --tied

Evaluation

    python3 decode.py --dataset de-en --token-type word --loss vmf --emb-type w2v --batch-size 2048 --tied --reg1 1e-3 --reg2 0.1 --eval-checkpoint all

To run all 39 experiments with one command

    bash run_all.sh

Related Projects

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

13 Nov 2021 2,591

mlm-pytorch

An implementation of masked language modeling for Pytorch, made as concise and simple as possible

14 Aug 2020 175

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relat...

11 Feb 2017 11,389

SpeechEmotionRecognition-Pytorch

基于Pytorch实现的语音情感识别

07 Jul 2022 118

Reptile-Pytorch

28 May 2018 142

Complex-YOLOv4-Pytorch

The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...

03 Jul 2020 1,234

Pointer-Generator-Networks

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

NeuralNLP-NeuralClassifier

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

04 Jul 2019 1,830

gap-contrastive-and-supervised-losses

#ICML2022 Experimental codes of "On the Surrogate Gap between Contrastive and Supervised Losses"

minGPT-flax

GPT implementation in Flax

MAML-Pytorch

Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning (MAML)

01 Feb 2018 2,273

sentiment-discovery

Unsupervised Language Modeling at scale for robust sentiment classification

30 Nov 2017 1,061

electra-pytorch

A simple and working implementation of Electra, the fastest way to pretrain language models from ...

04 Aug 2020 222

fairseq_mmt

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

29 Aug 2017 29,423