Posts tagged scraper

Hadoop — Feeding Reddit to Hadoop

hadoop-feeding-reddit-to-hadoop

With Hadoop installed on our lean mean Arch machine, we’re ready to fire up the first computations. Hadoop opens a world of fun with the promise of some heavy lifting and in order to feed the beast I’ve written a Reddit-scraper in just 30 lines of Clojure. More >

Enlive Vs Clojure-mode

enlive-vs-clojure-mode

Last night I published a screencast and in response to a comment I uploaded the htmlized source-code used in the screencast. The highlighting wasn’t working for me, so I put Emacs and Enlive to work!

More >

Clojure vs Ruby & Scala — Transient Newsgroups

clojure-vs-ruby-scala-transient-newsgroups

Recently I had the good pleasure of reading a blogpost which demonstrated a fun exercise in both Ruby and Scala, namely scraping newsgroups. I had a look at both solutions and decided to roll one in Clojure as well, examining the differences between the famous Ruby, the Juggernaut Scala and the elegant Clojure.


More >

MacSwing meets Enlive — Functional Social Webscraping!

macswing-meets-enlive-functional-social-webscraping

In this post I’ll show you how to make a beautiful Swing application with all the Mac-trimmings, a functional webscraper which gracefully overlooks malformed html and finally how to have some exploratory fun with Clojure, REPL style!

More >