Siamese LSTM for Text Classification

Goal

Create model for text classification problems (including intent detection and sentiment analysis) that only requires a small amount of labeled data.

Architecture

Uses a regression Siamese Recurrent Network [1] that serves as the distance function for a k-Nearest Neighbor model.

The Siamese network learns to generate distance values for each pair of sentences within the corpus. A pair with the same label comes with a desired value of 0, while a pair with different labels comes with a very large arbitrary value.

The learned network is then used by a k-NN model using the training data. Evaluation is done on the k-NN with test data.

Siamese Model Diagram

Current configurations

Embedding layer: uniform distributed 200-dimensional vectors
Bidirectional LSTMs: 3 layer-pairs of size 1024, 512, 256. Activation: tanh
Single-outpur LSTM: size 256. Activation: sigmoid
Dense layer: size 128. Activation: linear
Merge layer: Mean absolute difference
Loss function: An experimental loss function where:
Batch size: 50
Dropout: 0.1
L2 regularization: 0.03

References

Learning Text Similarity with Siamese Recurrent Networks

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
charlm		charlm
docs		docs
neuralknn		neuralknn
neuralsvm		neuralsvm
plots		plots
siamese		siamese
tests		tests
.gitignore		.gitignore
README.md		README.md
benchmark_lstm.py		benchmark_lstm.py
eval.py		eval.py
eval_svm.py		eval_svm.py
eval_word.py		eval_word.py
predict.py		predict.py
predict_word.py		predict_word.py
serve_plots.py		serve_plots.py
train.py		train.py
train_word.py		train_word.py
visualize_sentences.py		visualize_sentences.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Siamese LSTM for Text Classification

Goal

Architecture

Siamese Model Diagram

Current configurations

References

About

Releases

Packages

Languages

GKarmakar/siamese-lstm

Folders and files

Latest commit

History

Repository files navigation

Siamese LSTM for Text Classification

Goal

Architecture

Siamese Model Diagram

Current configurations

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages