Original notebook by Max Woolf Updated for social media scraping by Shaunak G
Retrain an advanced text generating neural network on any text dataset for free on a GPU using Collaboratory using gpt-2-simple
. Scrape text from reddit, twitter or tumblr using the associated libraries and train the neural network with it.
For more about gpt-2-simple
, you can visit this GitHub repository. You can also read the original creator's blog post for more information how to use this notebook.
To get started:
- Copy this notebook to your Google Drive to keep it and save your changes. (File -> Save a Copy in Drive)
- Make sure you're running the notebook in Google Chrome.
- Run the cells below.
Note: some cells have been commented out for continuity when running (Runtime -> Restart and Run All). You can mix and match the cells by deleting or commenting them out yourself if you want to change functionality.