Yuriy's bookshelf: dev-data en-US Tue, 16 Nov 2021 06:28:43 -0800 60 Yuriy's bookshelf: dev-data 144 41 /images/layout/goodreads_logo_144.jpg <![CDATA[Instant MapReduce Patterns � Hadoop Essentials How-to]]> 19032105 60 Srinath Perera Yuriy 0 dev-data, just-interesting 3.60 2013 Instant MapReduce Patterns – Hadoop Essentials How-to
author: Srinath Perera
name: Yuriy
average rating: 3.60
book published: 2013
rating: 0
read at:
date added: 2021/11/16
shelves: dev-data, just-interesting
review:

]]>
<![CDATA[Statistics, Data Mining, and Machine Learning in Astronomy: A Practical Python Guide for the Analysis of Survey Data (Princeton Series in Modern Observational Astronomy, 1)]]> 17861507
"Statistics, Data Mining, and Machine Learning in Astronomy" presents a wealth of practical analysis problems, evaluates techniques for solving them, and explains how to use various approaches for different types and sizes of data sets. For all applications described in the book, Python code and example data sets are provided. The supporting data sets have been carefully selected from contemporary astronomical surveys (for example, the Sloan Digital Sky Survey) and are easy to download and use. The accompanying Python code is publicly available, well documented, and follows uniform coding standards. Together, the data sets and code enable readers to reproduce all the figures and examples, evaluate the methods, and adapt them to their own fields of interest. Describes the most useful statistical and data-mining methods for extracting knowledge from huge and complex astronomical data sets Features real-world data sets from contemporary astronomical surveys Uses a freely available Python codebase throughout Ideal for students and working astronomers]]>
560 Željko Ivezić 0691151687 Yuriy 0 dev-data, just-interesting 4.00 2013 Statistics, Data Mining, and Machine Learning in Astronomy: A Practical Python Guide for the Analysis of Survey Data (Princeton Series in Modern Observational Astronomy, 1)
author: Željko Ivezić
name: Yuriy
average rating: 4.00
book published: 2013
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
<![CDATA[Programming Elastic MapReduce: Using AWS Services to Build an End-to-End Application]]> 19471016

Although you don’t need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started. This practical guide shows you how to quickly launch data analysis projects in the cloud by using Amazon Elastic MapReduce (EMR), the hosted Hadoop framework in Amazon Web Services (AWS).

Authors Kevin Schmidt and Christopher Phillips demonstrate best practices for using EMR and various AWS and Apache technologies by walking you through the construction of a sample MapReduce log analysis application. Using code samples and example configurations, you’ll learn how to assemble the building blocks necessary to solve your biggest data analysis problems.

Get an overview of the AWS and Apache software tools used in large-scale data analysis Go through the process of executing a Job Flow with a simple log analyzer Discover useful MapReduce patterns for filtering and analyzing data sets Use Apache Hive and Pig instead of Java to build a MapReduce Job Flow Learn the basics for using Amazon EMR to run machine learning algorithms Develop a project cost model for using Amazon EMR and other AWS tools ]]>
174 Kevin Schmidt 1449364020 Yuriy 0 dev-data, just-interesting 3.00 2013 Programming Elastic MapReduce: Using AWS Services to Build an End-to-End Application
author: Kevin Schmidt
name: Yuriy
average rating: 3.00
book published: 2013
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
<![CDATA[Programming Elastic Mapreduce: Using Aws Services to Build an End-To-End Application]]> 17346945
Authors Kevin Schmidt and Christopher Phillips demonstrate best practices for using EMR and various AWS and Apache technologies by walking you through the construction of a sample MapReduce log analysis application. Using code samples and example configurations, you’ll learn how to assemble the building blocks necessary to solve your biggest data analysis problems.


Get an overview of the AWS and Apache software tools used in large-scale data analysis
Go through the process of executing a Job Flow with a simple log analyzer
Discover useful MapReduce patterns for filtering and analyzing data sets
Use Apache Hive and Pig instead of Java to build a MapReduce Job Flow
Learn the basics for using Amazon EMR to run machine learning algorithms
Develop a project cost model for using Amazon EMR and other AWS tools]]>
174 Kevin Schmidt 1449363628 Yuriy 0 dev-data, just-interesting 3.29 2013 Programming Elastic Mapreduce: Using Aws Services to Build an End-To-End Application
author: Kevin Schmidt
name: Yuriy
average rating: 3.29
book published: 2013
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
<![CDATA[MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems]]> 14514285
--Tom White, author of The Definitive Guide]]>
247 Donald Miner 1449327176 Yuriy 0 dev-data, just-interesting 3.90 2012 MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems
author: Donald Miner
name: Yuriy
average rating: 3.90
book published: 2012
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
<![CDATA[Massively Parallel Databases and Mapreduce Systems (Foundations and Trends(r) in Databases)]]> 18960568 120 Shivnath Babu 1601987501 Yuriy 0 dev-data, just-interesting 5.00 2013 Massively Parallel Databases and Mapreduce Systems (Foundations and Trends(r) in Databases)
author: Shivnath Babu
name: Yuriy
average rating: 5.00
book published: 2013
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
<![CDATA[Amazon Elastic MapReduce (Amazon EMR) Developer Guide]]> 18670785 937 Amazon Web Services Yuriy 0 dev-data, just-interesting 3.53 2012 Amazon Elastic MapReduce (Amazon EMR) Developer Guide
author: Amazon Web Services
name: Yuriy
average rating: 3.53
book published: 2012
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
Hadoop: The Definitive Guide 6308439 528 Tom White 0596521979 Yuriy 0 dev-data, just-interesting 4.00 2009 Hadoop: The Definitive Guide
author: Tom White
name: Yuriy
average rating: 4.00
book published: 2009
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
<![CDATA[Data Mining: Practical Machine Learning Tools and Techniques]]> 213031
Download Link :


ÌýÌýÌýÌýÌýÌý

ÌýÌýÌýÌýÌýÌý


0128042915 Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kaufmann Series in Data Management Systems) PDF by Ian H. Witten
Read Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kaufmann Series in Data Management Systems) PDF from Morgan Kaufmann,Ian H. Witten
Download Ian H. Witten's PDF E-book Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kaufmann Series in Data Management Systems)]]>
560 Ian H. Witten 0120884070 Yuriy 0 dev-data, just-interesting 3.92 1999 Data Mining: Practical Machine Learning Tools and Techniques
author: Ian H. Witten
name: Yuriy
average rating: 3.92
book published: 1999
rating: 0
read at:
date added: 2018/07/15
shelves: dev-data, just-interesting
review:

]]>
Mining of Massive Datasets 12818088 326 Jure Leskovec 1107015359 Yuriy 5 4.35 2011 Mining of Massive Datasets
author: Jure Leskovec
name: Yuriy
average rating: 4.35
book published: 2011
rating: 5
read at: 2014/03/06
date added: 2016/09/19
shelves: favorite, read_on_english, dev-data
review:

]]>