Sequence-to-Sequence Model Configuration:

This document outlines the configuration parameters for training a sequence-to-sequence model for various tasks such as machine translation, text summarization, and more.

Dependencies

Make sure you have the following libraries installed:

torch >= 1.0
pandas
scikit-learn (needed for heatmap plot using wandb)
tqdm >= 4.0
wandb = 0.14.0 (If you want to plot attention heatmap using wandb plots as in latest version heatmap functionality is depriciated)
argparse

You can install these dependencies using pip:

!pip install wandb==0.14.0 scikit-learn

Hyperparameters:

Input Embedding Size : [16, 32, 64, 256]

Description: Specifies the size of the input embedding vector.

Encoder Number of Layers : [1, 2, 3]

Description: Specifies the number of layers in the encoder network.

Decoder Number of Layers : [1, 2, 3]

Description: Specifies the number of layers in the decoder network.

Hidden Size : [128, 256, 512, 1024]

Description: Specifies the size of the hidden state in the RNN cells.

Cell Type : ["RNN", "GRU", "LSTM"]

Description: Specifies the type of recurrent cell to be used in the encoder and decoder.

Bidirectional : [True, False]

Description: Specifies whether the encoder is bidirectional or not.

Batch Size : [32, 64, 128]

Description: Specifies the number of training examples in each batch.

Learning Rate : [0.001, 0.0001]

Description: Specifies the learning rate for training the model.

Number of Epochs : [5, 10, 15 ,20]

Description: Specifies the number of training epochs.

Dropout : [0.2, 0.3]

Description: Specifies the dropout probability for regularization.

Teacher Forcing Ratio : [0.5]

Description: Specifies the probability of using teacher forcing during training.

Attention : [True,False]

Description: Specifies whether you want attention mechanism to be used in the model.

Mode: ["Normal","Test"]

Description: Specifies whether You want to run Train and Validation Dataset or want to run on Train and Test Dataset.

For CS23M046_DL_assignment_3.ipynb :

After satisfying above mentioned dependencies you are now need to following steps for ipynb file to run.
in main_1() function which is present in last cell of ipynb file you have to add dataset folder aksharantar_sampled path and make sure this path is unzipped and not the zip file path.
in main_1() function which is present in last cell of ipynb file you can choose language of your choice in Folder_name parameter by passing folder name
in main_1() function you can choose mode parameter value to be 'Normal' if you want to use Train and Validation dataset only and if you want to choose Train and Test dataset then assign 'Test' to mode parameter.
After implementing above steps you can run ipynb file sequencially from top cell number 1 to last cell and you can see the results according to sweeep config.
Commentes are also applied in each cell code to understand the code flow.

For train.py

After satisfying above dependencies you can choose values of hyper parameters and and you can run code by passing as command line .
Description of each hyper parameters also mentioned above so based on that you can modify following command to run train.py script.

Example Usage for train.py:

python train.py --input_embedding_size 32 --encoder_num_layers 1 --decoder_num_layers 2 --hidden_size 1024 --cell_type LSTM --bidirectional True --batch_size 128 --learning_rate 0.001 --num_epochs 15 --dropout 0.3 --teacher_forcing_ratio 0.5 --attention False --mode Normal --wandb_project DL_Assignment_3_CS23M046 --wandb_entity cs23m046 --Folder_path '/kaggle/input/aksharantar-sampled/aksharantar_sampled' --Folder_name 'guj'

Replace '/kaggle/input/aksharantar-sampled/aksharantar_sampled' with the actual path to dataset in Folder_path argument.If you want to use different language then you can replace 'guj' with your language name in Folder_name argument

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
predictions_attention		predictions_attention
CS23M046_DL_assignment_3.ipynb		CS23M046_DL_assignment_3.ipynb
README.md		README.md
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sequence-to-Sequence Model Configuration:

Dependencies

Hyperparameters:

Input Embedding Size : [16, 32, 64, 256]

Encoder Number of Layers : [1, 2, 3]

Decoder Number of Layers : [1, 2, 3]

Hidden Size : [128, 256, 512, 1024]

Cell Type : ["RNN", "GRU", "LSTM"]

Bidirectional : [True, False]

Batch Size : [32, 64, 128]

Learning Rate : [0.001, 0.0001]

Number of Epochs : [5, 10, 15 ,20]

Dropout : [0.2, 0.3]

Teacher Forcing Ratio : [0.5]

Attention : [True,False]

Mode: ["Normal","Test"]

For CS23M046_DL_assignment_3.ipynb :

For train.py

Example Usage for train.py:

Replace '/kaggle/input/aksharantar-sampled/aksharantar_sampled' with the actual path to dataset in Folder_path argument.If you want to use different language then you can replace 'guj' with your language name in Folder_name argument

About

Releases

Packages

Languages

pankil25/Language-transliteration-using-Seq2Seq-models

Folders and files

Latest commit

History

Repository files navigation

Sequence-to-Sequence Model Configuration:

Dependencies

Hyperparameters:

Input Embedding Size : [16, 32, 64, 256]

Encoder Number of Layers : [1, 2, 3]

Decoder Number of Layers : [1, 2, 3]

Hidden Size : [128, 256, 512, 1024]

Cell Type : ["RNN", "GRU", "LSTM"]

Bidirectional : [True, False]

Batch Size : [32, 64, 128]

Learning Rate : [0.001, 0.0001]

Number of Epochs : [5, 10, 15 ,20]

Dropout : [0.2, 0.3]

Teacher Forcing Ratio : [0.5]

Attention : [True,False]

Mode: ["Normal","Test"]

For CS23M046_DL_assignment_3.ipynb :

For train.py

Example Usage for train.py:

Replace '/kaggle/input/aksharantar-sampled/aksharantar_sampled' with the actual path to dataset in Folder_path argument.If you want to use different language then you can replace 'guj' with your language name in Folder_name argument

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages