This is a great talk with the head of Yahoo’s grid team that talks about the open source project Hadoop, which is an open source distributed file system and MapReduce implementation. The video is long and interspersed with Yahoo! specifics you might not care about – but keep watching, because they swing back to talking tech and about how you can write MapReduce programs in whatever language you want, and how you can do actual Hadoop programming using Python.
I’m excited about Hadoop, and I have tons of data to work with, but don’t currently have the cycles to devote to testing and playing with it. I hope to be able to at least get something up and running with it soon!