















  • 博客(0)
  • 资源 (7)
  • 收藏
  • 关注


Big Data Principles and Best Practices of Scalable Realtime Data Systems

Web-scale applications like social networks, real-time analytics, or e-commerce sites deal with a lot of data, whose volume and velocity exceed the limits of traditional database systems. These applications require architectures built around clusters of machines to store and process data of any size, or speed. Fortunately, scale and simplicity are not mutually exclusive. Big Data teaches you to build big data systems using an architecture designed specifically to capture and analyze web-scale data. This book presents the Lambda Architecture, a scalable, easy-to-understand approach that can be built and run by a small team. You'll explore the theory of big data systems and how to implement them in practice. In addition to discovering a general framework for processing big data, you'll learn specific technologies like Hadoop, Storm, and NoSQL databases. This book requires no previous exposure to large-scale data analysis or NoSQL tools. Familiarity with traditional databases is helpful.


Learning to Love Data Science

Until recently, many people thought big data was a passing fad. “Data science” was an enigmatic term. Today, big data is taken seriously, and data science is considered downright sexy. With this anthology of reports from award-winning journalist Mike Barlow, you’ll appreciate how data science is fundamentally altering our world, for better and for worse. Barlow paints a picture of the emerging data space in broad strokes. From new techniques and tools to the use of data for social good, you’ll find out how far data science reaches.


Programming Hive

Hive makes life much easier for developers who work with stored and managed data in Hadoop clusters, such as data warehouses. With this example-driven guide, you'll learn how to use the Hive infrastructure to provide data summarization, query, and analysis - particularly with HiveQL, the query language dialect of SQL. You'll learn how to set up Hive in your environment and optimize its use, and how it interoperates with other tools, such as HBase. You'll also learn how to extend Hive with custom code written in Java or scripting languages. Ideal for developers with prior SQL experience, this book shows you how Hive simplifies many tasks that would be much harder to implement in the lower-level MapReduce API provided by Hadoop.


Getting Started with Storm

Even as data is turning the world upside down, the next revolution is taking shape: realtime data analysis. Data tools like Hadoop are batch-oriented; they're good at taking a set of data, analyzing it, and giving you the result later. But these tools are not particularly good at handling streaming data: data that is constantly arriving, and results that are changing as fast as the data arrives. Storm is a new tool that solves that problem. This book gets you started with it.


HBase in Action

When dealing with big data, traditional databases don't cut it. It's best to use the right tool for the job. HBase is a NoSQL storage system designed from the ground up for fast, random access to large volumes of data. Built on Hadoop, it runs on commodity hardware and scales from modest datasets up to millions of columns and billions of rows. HBase in Action provides all the knowledge needed to design, build, and run applications using HBase. First, it introduces the fundamentals of distributed systems and large scale data handling. Then, it explores real-world applications and code samples with just enough theory to explain practical techniques. It shows readers how to build applications with HBase and take advantage of the MapReduce processing framework as they learn patterns and best practices. ABOUT THE TECHNOLOGY HBase is an open-source, column-oriented database management system written in Java. It runs on top of the Hadoop Distributed File System (HDFS) and is fast enough to directly power websites and integrates tightly with the MapReduce processing framework, a programming model for processing large data sets. HBase has become the data backbone for production deployments worldwide in companies like StumbleUpon, Facebook, and Twitter.


Pro LINQ: Language Integrated Query in C# 2008

LINQ is the project name for a set of extensions to the .NET Framework that provide a generic approach to querying data from different data sources. LINQ will premier in Visual Studio 2008, and will become the next must-have skill for .NET developers. Pro LINQ: Language Integrated Query in C# 2008 is all about code. Literally, this book starts with code and ends with code. In most books, the author shows the simplest example demonstrating how to use a method, but they so rarely show how to use the more complex prototypes. Pro LINQ: Language Integrated Query in C# 2008 is different. Demonstrating the overwhelming majority of LINQ operators and protoypes, it is a veritable treasury of LINQ examples. Rather than obscure the relevant LINQ principles in code examples by focusing on a demonstration application you have no interest in writing, Pro LINQ: Language Integrated Query in C# 2008 cuts right to the chase of each LINQ operator, method, or class. However, where complexity is necessary to truly demonstrate an issue, the examples are right there in the thick of it. For example, code samples demonstrating how to handle concurrency conflicts actually create concurrency conflicts so you can step through the code and see them unfold. Most books tell you about the simple stuff, while few books warn you of the pitfalls. Where Pro LINQ: Language Integrated Query in C# 2008 returns your investment is in the hours, and sometimes days, spent by the author determining why something may not work as expected. Sometimes this results in an innocent looking paragraph that may take you a minute to read and understand, but took days to research and explain. Face it, most technical books while informative, are dull. LINQ need not be dull. Written with a sense of humor, Pro LINQ: Language Integrated Query in C# 2008 will attempt to entertain you on your journey through the wonderland of LINQ and C# 2008. What you'll learn How to leverage all the new LINQ relevant C# 2008 language features including extension methods, lambda expressions, anonymous data types, and partial methods. How to use LINQ to Objects to query in-memory data collections such as arrays, ArrayLists, and Lists to retrieve the data you want. Why some queries are deferred, how a deferred query can bite you, and how you can make deferred queries work for you. How to use LINQ to XML to revolutionize your creation, manipulation, and searching of XML data. How to query DataSets with LINQ to DataSet so you can co-exist with legacy code and use LINQ to query databases other than SQL Server. How to query Databases with LINQ to SQL, write your own entity classes, and understand how to handle concurrency conflicts.


Refactoring Improving the Design of Existing Code

重构:改善既有代码的设计(英文版) Refactoring Improving the Design of Existing Code 软工经典,必读书籍



TA创建的收藏夹 TA关注的收藏夹


取消 删除