Hadoop Streaming and F# MapReduce
And now for something completely different.As you may know Microsoft has recently announced plans for a Hadoop adoption for both Windows Server and Windows Azure. You can find out more about Hadoop and...
View ArticleMapReduce Tester: A Quick Word
In my previous post I talked a little about testing the Hadoop Streaming F# MapReduce code; but it is worth saying a few words about the tester application.The complete code for this blog post and the...
View ArticleHadoop Binary Streaming and F# MapReduce
As mentioned in my previous post Hadoop Streaming not only supports text streaming, but it also supports Binary Streaming. As such I wanted to put together a sample that supports processing Office...
View ArticleHadoop Binary Streaming and PDF File Inclusion
In a previous post I talked about Hadoop Binary Streaming for the processing of Microsoft Office Word documents. However, due to there popularity, I thought inclusion for support of Adobe PDF documents...
View ArticleHadoop Streaming and Reporting
If like me you are a .Net developer and have written some Streaming jobs it is not immediately obvious how one can do any reporting. However if you dig through the Streaming Documentation you will come...
View ArticleHadoop Streaming and Windows Azure Blob Storage
One of the cool features of the Microsoft Distribution of Hadoop (MDH) is the native support for Windows Azure Blob Storage.When performing HDFS operations by default one can omit the scheme such...
View ArticleHadoop XML Streaming and F# MapReduce
So, to round out the Hadoop Streaming samples I thought I would put together an XML Streaming sample. As always the code can be found...
View ArticleHadoop Streaming in F# and MapReduce (summary)
With all my recent posts around Hadoop Streaming I thought it would be useful to summarize them into a single post. The main objective of these posts was to put together a codebase to enable F#...
View ArticleFramework for Composing and Submitting .Net Hadoop MapReduce Jobs (Archived)
An updated version of this post can be found at:http://blogs.msdn.com/b/carlnol/archive/2012/04/29/generic-based-framework-for-net-hadoop-mapreduce-job-submission.aspxIf you have been following my blog...
View Article.Net Hadoop MapReduce Job Framework - Revisited (Archived)
An updated version of this post can be found at:http://blogs.msdn.com/b/carlnol/archive/2012/04/29/generic-based-framework-for-net-hadoop-mapreduce-job-submission.aspxIf you have been using the...
View ArticleGenerics based Framework for .Net Hadoop MapReduce Job Submission
Over the past month I have been working on a framework to allow composition and submission of MapReduce jobs using .Net. I have put together two previous blog posts on this, so rather than put together...
View ArticleFramework for .Net Hadoop MapReduce Job Submission libjars update
If you have been using the “Generics based Framework for .Net Hadoop MapReduce Job Submission” you may want to download the latest version of the code.The previous version of the code, when processing...
View ArticleFramework for .Net Hadoop MapReduce Job Submission Binary Output
To end the week I decided to make a minor change to the “Generics based Framework for .Net Hadoop MapReduce Job Submission”.I have been doing some work on creating a co-occurrence matrix for item...
View ArticleFramework for .Net Hadoop MapReduce Job Submission configuration update
To better support configuring the Stream environment whilst running .Net Streaming jobs I have made a change to the “Generics based Framework for .Net Hadoop MapReduce Job Submission” code.I have fixed...
View ArticleFramework for .Net Hadoop MapReduce Job Submission Json Serialization
A while back one of the changes made to the “Generics based Framework for .Net Hadoop MapReduce Job Submission” code was to support Binary Serialization from Mapper, in and out of Combiners, and out...
View ArticleMapReduce Based Co-occurrence Approach to an Item Based Recommender
In a previous post I covered the basics for a Co-occurrence Approach to an Item Based Recommender. As promised, here is the continuation of this work, an implementation of the same algorithm using...
View ArticleC# MapReduce Based Co-occurrence Item Based Recommender
As promised, to conclude the Co-occurrence Approach to an Item Based Recommender posts I wanted to port the MapReduce code to C#; just for kicks and to prove the code is also easy to write in C#. For...
View ArticleFramework for .Net Hadoop MapReduce Job Submission TextOutput Type
Some recent changes made to the “Generics based Framework for .Net Hadoop MapReduce Job Submission” code were to support Json and Binary Serialization from Mapper, in and out of Combiners, and out from...
View ArticleFramework for .Net Hadoop MapReduce Job Submission V1.0 Release
It has been a few months since I have made a change to the “Generics based Framework for .Net Hadoop MapReduce Job Submission” code. However I was going to put together a sample for a Reduce side join...
View ArticleImplementing a MapReduce Join with Hadoop and the .Net Framework
I have often been asked how does one implement a Join whilst writing MapReduce code. As such, I thought it would be useful to add an additional sample demonstrating how this is achieved. There are...
View Article