CS6910_Assignment3

##Assignment 3 for CS6910: Fundamentals of Deep Learning

The init_lang function initializes the language model. It takes the input language name and the output language name as input and returns the input and output language objects.
The read_words function reads the words from the file and returns a list of words. Its input is the input language name and the output is a list of pair of input/output words.

Then make two dictionaries, one for the encoder and the other for the decoder. These dictionaries contain the hyperparameters for the encoder and the decoder respectively. Example: encoder_hp = {

'input_size': input_lang.n_letters, 

'embedding_size': 64, 

'hidden_size': 512, 

'num_layers': 2, 

'dropout_p': 0.1,

'type': 'gru',

'bidirectional': False}

decoder_hp = {

'hidden_size': 512, 

'embedding_size': 64, 

'output_size': output_lang.n_letters, 

'num_layers': 2, 

'dropout_p': 0.1,

'type': 'lstm',

'bidirectional': False}

Send the hyperparameters to the encoder and decoder classes along with a bool for attn.
Usage: model = Transliterator(encoder_hp=encoder_hp, decoder_hp=decoder_hp, attn = True)
The model.fit function takes the input and output language objects, the input and output words for both training and validation, the number of epochs, the learning rate, the batch size, the teacher forcing ratio, the optimizer and the loss function as input. It prints the loss and accuracy for both training and validation for each epoch. It stores the model with the best validation accuracy.
Usage: {

model.fit(train_pairs, validation_pairs, input_lang, output_lang, optimizer, n_epochs , learning_rate , teacher_forcing_ratio )

}
The model.predict function takes an input word and the model as input and returns the predicted word and attention heatmap(If attn is set to True).
Usage: { model.predict(input_word) }
The model.eval function takes the input and output words for validation, the model and returns the accuracy and the attention heatmap(if attn is set to True).
Usage: { model.eval(validation_pairs) }
The use_wandb flag can be set as True to use wandb for logging the metrics. Wandb report can be found at: https://wandb.ai/cs20b004/CS6910_Assignment3/reports/CS6910-Assignment-3--Vmlldzo0NDIzMDY1

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
A3.ipynb		A3.ipynb
README.md		README.md
output.png		output.png
predictions_attention.csv		predictions_attention.csv
predictions_vanilla.csv		predictions_vanilla.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS6910_Assignment3

About

Releases

Packages

Languages

Aytien/CS6910_Assignment3

Folders and files

Latest commit

History

Repository files navigation

CS6910_Assignment3

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages