随着全球数据呈爆发式增长,基于海量数据的挖掘和分析,为用户带来了巨大的商业价值。源于开源平台的Apache Hadoop,允许使用简单的编程模型跨计算机集群分布式处理大型数据集,成为大数据时代最受欢迎的技术之一。 HDFS分布式文件系统作为Hadoop的三大组件 ...
| 本文转载自“腾讯游戏存储与计算技术”,作者:周晓,腾讯游戏 CROS 体系高级工程师,负责多套HDFS集群的维护管理,并为Apache Hadoop社区提交过2个Patch。同时也作为游戏DBA,稳定支撑包括穿越火线、天天酷跑等在内多款游戏的DB管理维护。 这次我吐血整理了 ...
What are some of the cool things in the 2.0 release of Hadoop? To start, how about a revamped MapReduce? And what would you think of a high availability (HA) implementation of the Hadoop Distributed ...
It would be pure understatement to say that the world has changed since Hadoop debuted just over a decade ago. Rewind the tape to 5 - 10 years ago, and if you wanted to work with big data, Hadoop was ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Apache Hadoop has been the driving force behind the growth of the big data industry. You'll hear it mentioned often, along with associated technologies such as Hive and Pig. But what does it do, and ...
In the battle for big data storage supremacy, Hadoop is still in the running. It may no longer be the 800-lb gorilla, but the demonstrated scalability of the Hadoop Distributed File System (HDFS) ...
This article is part of the Five Essential Strategies for Successful HPC Clusters series which was written to help managers, administrators, and users deploy and operate successful HPC clusters, ...
As a poster child for big data, Hadoop is continually brought out as the reference architecture for big data analytics. But what exactly is Hadoop and what are the key points of Hadoop storage ...
A peek under the hood of Yahoo's Hadoop deployment illustrates how vast the ecosystem has become -- and how the company that invented it is still leading the way We’ve heard a lot lately about how ...