In Part One of the Hadoop series, Merv Adrian points out how significant attention is being lavished on performance . In this second installment, the topic is projects, which are proliferating precipitously. One of the most frequent client inquiries is “which of these pieces make Hadoop?” As recently as a year ago, the question was pretty simple for most people: MapReduce, HDFS, maybe Sqoop and even Flume, Hive, Pig, HBase, Lucene/Solr, Oozie, Zookeeper. When the Gartner piece How to Choose the Right Apache Hadoop Distribution was published, that was pretty much it. Since then, more projects have matured, and SQLstream is now part of of shortlist. Read full article on