- 博客(0)
- 资源 (10)
- 收藏
- 关注
Cloud Native Devops with Kubernetes
Building, Deploying, and Scaling Modern Applications in the Cloud
John Arundel and Justin Domingus
2019-04-08
Natural Language Processing Fundamentals
Build intelligent applications that can interpret the human language to deliver impactful results.
Sohom Ghosh and Dwight Gunning
2019-04-08
Architects of Intelligence
How can AI evolve to human levels? What could its impact on society and the economy be? What new AI innovations are on the horizon?
23 experts talk to world-leading AI authority Martin Ford on all of this and more. Find out what the future holds for AI in Architects of Intelligence, one of the Financial Times Books of the Year 2018.
2019-03-07
Practical Data Science with Hadoop and Spark
Designing and Building Effective Analytics at Scale
Ofer Mendelevitch
Casey Stella
Douglas Eadline
2019-02-06
FLASK_BUILDING_PYTHON_WEB_SERVICES
Unleash the full potential of the Flask web framework by creating small to large and powerful web applications.
2019-01-30
Big Data Analytics
Big Data Analytics - A handy reference guide for data analysts and data
scientists to help to obtain value from big data analytics
using Spark on Hadoop clusters
2019-01-30
Advanced Analytics with Spark- Second Edition
Spark: The Definitive Guide - 2nd Edition
by Bill Chambers and Matei Zaharia
2019-01-30
Elasticsearch for Hadoop
Table of Contents
Elasticsearch for Hadoop
Credits
About the Author
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers, and more
Why subscribe?
Free access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
1. Setting Up Environment
Setting up Hadoop for Elasticsearch
Setting up Java
Setting up a dedicated user
Installing SSH and setting up the certificate
Downloading Hadoop
Setting up environment variables
Configuring Hadoop
Configuring core-site.xml
Configuring hdfs-site.xml
Configuring yarn-site.xml
Configuring mapred-site.xml
The format distributed filesystem
Starting Hadoop daemons
Setting up Elasticsearch
Downloading Elasticsearch
Configuring Elasticsearch
Installing Elasticsearch's Head plugin
Installing the Marvel plugin
Running and testing
Running the WordCount example
Getting the examples and building the job JAR file
Importing the test file to HDFS
Running our first job
Exploring data in Head and Marvel
Viewing data in Head
Using the Marvel dashboard
Exploring the data in Sense
Summary
2. Getting Started with ES-Hadoop
Understanding the WordCount program
Understanding Mapper
Understanding the reducer
Understanding the driver
Using the old API – org.apache.hadoop.mapred
Going real — network monitoring data
Getting and understanding the data
Knowing the problems
Solution approaches
Approach 1 – Preaggregate the results
Approach 2 – Aggregate the results at query-time
Writing the NetworkLogsMapper job
Writing the mapper class
Writing Driver
Building the job
Getting the data into HDFS
Running the job
Viewing the Top N results
Getting data from Elasticsearch to HDFS
Understanding the Twitter dataset
Trying it yourself
Creating the MapReduce job to import data from Elasticsearch to HDFS
Writing the Tweets2Hdfs mapper
Running the example
Testing the job execution output
Summary ...
2018-11-27
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人