Skip to content

An implementation of frequent pattern mining using the Apriori algorithm to mine meaningful, representative phrases for domains of computer science from titles of academic papers.

Notifications You must be signed in to change notification settings

dillondavis/MiningMeaningfulPhrases

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MiningMeaningfulPhrases

An implementation of frequent pattern mining using the Apriori algorithm to mine meaningful, representative phrases for domains of computer science from titles of academic papers.

The topics are Information Retrieval, Databases, Data Mining, Machine Learning, and Theory. The corpus of paper titles is partitioned using Latent Dirichlet allocation from a natural language processing library.

Simply run 'python mine.py' in the src folder to run the entire program. Check out the results in 'results/pure_patterns' where phrases are ordered from most to least meaningful for each topic!

About

An implementation of frequent pattern mining using the Apriori algorithm to mine meaningful, representative phrases for domains of computer science from titles of academic papers.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published