Skip to content

Scrape Google Scholar Profile into a custom HTML using Ruby (Nokogiri)

License

Notifications You must be signed in to change notification settings

raivivek/rubyscholar

 
 

Repository files navigation

Rubyscholar

Synopsis

Here is a small script to "scrape" your Google Scholar citations and reformat them (the way I need it for my website). Not super flexible - but should be easily customizable.

Some features:

  • if registered on Crossref, retreives corresponding DOIs and can add altmetric.org links. If Crossref doesn't think your email is valid, no DOIs will be retreived.
  • adds "Cited by N" for popular papers

How to use:

As a ruby script:

  1. Configure "config.yaml" If you want DOI retreival to work (including Altmetrics), you need to be registered at crossref (its free).
  2. Run bundle install then bundle exec rubyscholar. (Note if this fails, you first need to do gem install bundler). I've only tested with ruby 1.9.3
  3. A publications.html file is created with your publications from google scholar.
  4. Thats it.

Potential for improvement:

  • uses author list as visible on your main Google Scholar page. Sometimes this means names are chopped in two or just a single author is missing. This could be made smarter (by following the link to get the full author list).
  • output format could be more flexible. (e.g. change order (eg title before authors), or change formatting (e.g. remove first initial)). Perhaps this could be done with by providing a regexp search/replace configuration option within each field.
  • Ensure that a true email is entered.
  • right now only works from "user profile" pages. Not from "articles citing article" pages.
  • flexible use of DOIs

Technologies

Ruby, Nokogiri. Thanks to Google Scholar and Crossref. I hope none of this infringes on anything.

Contact

RubyScholar was developed by Yannick Wurm (http://yannick.poulet.org). Pull requests, patches and bug reports are welcome. The source code is available on github. Bug reports and feature requests may also be made there.

Copyright

RubyScholar © 2013 by Yannick Wurm. Licensed under the MIT license.

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request

About

Scrape Google Scholar Profile into a custom HTML using Ruby (Nokogiri)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Ruby 100.0%