Skip to content

Tungstic/node-meme-scraper

Repository files navigation

Node.js Meme Scraper

Task

Create a CLI application that scrapes the current version of this website:

Memegen Link

...and saves the first 10 images into a folder called "memes" within the directory of the new project. The image files should be named with a number with a leading zero, eg. 01.jpg, 02.jpg, etc. Make sure that the meme images are "ignored" in Git - they should not show up in your repository.

The program should be able to run multiple times without throwing an error.

Restrictions

Avoid using an "image scraper" or "image downloader" library that does multiple steps at once for you (eg. do not use image-downloader or nodejs-file-downloader or similar) - break the task down into smaller steps and select libraries as necessary for each step.

TODO

  • Command line cheatsheet steps (setup for Node.js project)
  • Set up a .gitignore with the memes folder
  • Create a folder called "memes"
  • Access the website (send a request to the website / "fetching")
  • Search for the section in the string of HTML that comes back (in the "response") from the website
  • Search for the first 10 image sources in the section
    • Two options
      • One option: .jpg - this is in the HTML code of the website
      • Another option: <img
    • Get the src URL strings from the img
  • Add the first 10 image URL strings to an array
  • Loop over the first 10 image URLs and:
    • Create a file (named correctly) in the memes folder (eg. 01.jpg, 02.jpg, etc)
    • Access the image URL
    • In this file, store the image data that comes back (in the "response") from the website
  • Test to make sure the program can run multiple times without failing

About

Node.js meme scraper for the first 10 images

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published