Spark and Shark tutorial/course given at Strata, materials online.
Nice tutorial on writing UDFs for Hive.
Tutorial on Machine Learning for Large Scale Recommender Systems from Deepak Agarwal and Bee-Chung Chen from Yahoo! Research.
I’m not sure how I haven’t seen this before, but it looks pretty amazing. It provides all the lecture slides and videos as well as student projects.
Check out the video playlist on Youtube
Tutorial on setting up Apache Accumulo, Nutch, and GORA for large scale web crawling.
Brief intro to using Apache Accumulo and Pig together.