Chiang_t-CSDN博客

排序：: 按最后发布时间; 按访问量; RSS订阅

空空如也

learning-pyspark.pdf

Learning pyspark It is estimated that in 2013 the whole world produced around 4.4 zettabytes of data; that is, 4.4 billion terabytes! By 2020, we (as the human race) are expected to produce ten times that. With data getting larger literally by the second, and given the growing appetite for making sense out of it, in 2004 Google employees Jeffrey Dean and Sanjay Ghemawat published the seminal paper MapReduce: Simplified Data Processing on Large Clusters. Since then, technologies leveraging the concept started growing very quickly with Apache Hadoop initially being the most popular. It ultimately created a Hadoop ecosystem that included abstraction layers such as Pig, Hive, and Mahout – all leveraging this simple concept of map and reduce.

2019-05-27

Oracle查询优化改写技巧与案例.pdf

2018-05-06

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人