Neural Nets

run the tests by doing a bundle exec rake

This repository aims to classify sentences by character frequency into languages based on passages in the bible.

The training set is all chapters of Matthew and Acts in Norwegian, English, Polish, Swedish, German, and Finnish.

There are two classes and a module in this repo and are:

Language - This takes a language training file, parses it and calculates an array of character frequencies built up using a hash.
Tokenizer - This is where the tokenization is accomplished for each text blob
Network - This interfaces with ruby-fann to train a neural network

To build a network built up on Language data one would do as follows

english = Language.new('English.txt', 'English')
network_of_english = Network.new([english])
network_of_english.train!

network_of_english.run('The quick brown fox') #=> english

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
lib		lib
public		public
script		script
test		test
views		views
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
README.md		README.md
Rakefile		Rakefile
app.rb		app.rb
config.ru		config.ru
swedish_chef_quotes.txt		swedish_chef_quotes.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Nets

About

Releases

Packages

Languages

hexgnu/language-predictor

Folders and files

Latest commit

History

Repository files navigation

Neural Nets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages