Transformer Model from Scratch

Overview

This repository contains an implementation of the Transformer model from scratch, built using TensorFlow. The dimension of this model as same as used in the paper Attention is all you need by ashish et al. This model is for language translation. The encode takes the input sentence and converts it to a matrix, then that matrix is passed to decoder, decoder also gets the output while training. It consist of six files all files are connected to each other. If you want to understand Transformers more in depth you should debug the code in book.

Features

Self-attention mechanism
Multi-head attention
Positional encoding
Encoder-decoder architecture

git clone https://github.com/Rohit2sali/TransformerFromScratch.git cd TransformerFromScratch

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Decoder.py		Decoder.py
Encoder.py		Encoder.py
english-german-both.pkl		english-german-both.pkl
multihead_attention.py		multihead_attention.py
positional_encoding.py		positional_encoding.py
prepareDataset.py		prepareDataset.py
readme.md		readme.md
trainModel.py		trainModel.py
transformer.py		transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer Model from Scratch

Overview

Features

About

Releases

Packages

Languages

Rohit2sali/TransformerFromScratch

Folders and files

Latest commit

History

Repository files navigation

Transformer Model from Scratch

Overview

Features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages