I just drank a ton of coffee and I’m blasting music in my headphones so this post my bit a bit more scatter-shot than most since I can’t really focus :]
Yesterday I managed to build a pretty naive scraper using Nokogiri which would count how often each word was used in the first 10 posts of this blog. Basically scraping the home URL of the site and grabbing everything inside of the div.entry-content
selector.
Today I want to convert it into a more OO library so it’s a bit more modular and reusable. I also want to back everything with RSpec tests to get into the practice. While it won’t be true TDD I’ll try to write the tests for the library before putting the classes together.