Free Ebooks at your fingertips

All Free Ebooks

Fast Data Processing with Spark – Second Edition

Perform real-time analytics using Spark in a fast, distributed, and scalable way About This BookDevelop a machine learning system with Spark's MLlib and scalable algorithmsDeploy Spark jobs to various clusters such as Mesos, EC2, Chef, YARN, EMR, and so onThis is a step-by-step tutorial that unleashes the power of Spark and its latest featuresWho This Book Is ForFast Data Processing with Spark - Second Edition is for software developers who want to learn how to write distributed programs with Spark. It will help developers who have had problems that were too big to be dealt with on a single computer. No previous experience with distributed programming is necessary. This book assumes knowledge of either Java, Scala, or Python.What You Will Learn Install and set up Spark on your cluster Prototype distributed applications with Spark's interactive shell Learn different ways to interact with Spark's distributed representation of data (RDDs) Query Spark with a SQL-like query syntax Effectively test your distributed software Recognize how Spark works with big data Implement machine learning systems with highly scalable algorithms In DetailSpark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach[...]

The Truth Machine: The Blockchain and the Future of Everything

"Views differ on bitcoin, but few doubt the transformative potential of Blockchain technology. The Truth Machine is the best book so far on what has happened and what may come along. It demands the attention of anyone concerned with our economic future." —Lawrence H. Summers, Charles W. Eliot University Professor and President Emeritus at Harvard, Former Treasury SecretaryFrom Michael J. Casey and Paul Vigna, the authors of The Age of Cryptocurrency, comes the definitive work on the Internet’s Next Big Thing: The Blockchain.Big banks have grown bigger and more entrenched. Privacy exists only until the next hack. Credit card fraud is a fact of life. Many of the “legacy systems” once designed to make our lives easier and our economy more efficient are no longer up to the task. Yet there is a way past all this—a new kind of operating system with the potential to revolutionize vast swaths of our economy: the blockchain. In The Truth Machine, Michael J. Casey and Paul Vigna demystify the blockchain and explain why it can restore personal control over our data, assets, and identities; grant billions of excluded people access to the global economy; and shift the balance of power to revive[...]

Data Manipulation with R – Second Edition

Efficiently perform data manipulation using the split-apply-combine strategy in R About This BookPerform data manipulation with add-on packages such as plyr, reshape, stringr, lubridate, and sqldfLearn about factor manipulation, string processing, and text manipulation techniques using the stringr and dplyr librariesEnhance your analytical skills in an intuitive way through step-by-step working examplesWho This Book Is ForThis book is for all those who wish to learn about data manipulation from scratch and excel at aggregating data effectively. It is expected that you have basic knowledge of R and have previously done some basic administration work with R.What You Will Learn Learn about R data types and their basic operations Work efficiently with string, factor, and date variables using stringr Understand group-wise data manipulation Work with different layouts of R datasets and interchange between layouts for varied purposes Manage bigger datasets using pylr and dpylr Perform data manipulation with add-on packages such as plyr, reshape, stringr, lubridate, and sqldf Manipulate datasets using SQL statements with the sqldf package Clean and structure raw data for data mining using text manipulation In DetailThis book starts with the installation of R and how to go about using R and its libraries. We then discuss the[...]

Life After Google: The Fall of Big Data and the Rise of the Blockchain Economy

A FINANCIAL TIMES BOOK OF THE MONTH FROM THE WALL STREET JOURNAL: "Nothing Mr. Gilder says or writes is ever delivered at anything less than the fullest philosophical decibel... Mr. Gilder sounds less like a tech guru than a poet, and his words tumble out in a romantic cascade." “Google’s algorithms assume the world’s future is nothing more than the next moment in a random process. George Gilder shows how deep this assumption goes, what motivates people to make it, and why it’s wrong: the future depends on human action.” — Peter Thiel, founder of PayPal and Palantir Technologies and author of Zero to One: Notes on Startups, or How to Build the Future The Age of Google, built on big data and machine intelligence, has been an awesome era. But it’s coming to an end. In Life after Google, George Gilder—the peerless visionary of technology and culture—explains why Silicon Valley is suffering a nervous breakdown and what to expect as the post-Google age dawns. Google’s astonishing ability to “search and sort” attracts the entire world to its search engine and countless other goodies—videos, maps, email, calendars.And everything it offers is free, or so it seems. Instead of paying directly[...]

R: Recipes for Analysis, Visualization and Machine Learning

Get savvy with R language and actualize projects aimed at analysis, visualization and machine learningAbout This BookProficiently analyze data and apply machine learning techniquesGenerate visualizations, develop interactive visualizations and applications to understand various data exploratory functions in RConstruct a predictive model by using a variety of machine learning packagesWho This Book Is ForThis Learning Path is ideal for those who have been exposed to R, but have not used it extensively yet. It covers the basics of using R and is written for new and intermediate R users interested in learning. This Learning Path also provides in-depth insights into professional techniques for analysis, visualization, and machine learning with R – it will help you increase your R expertise, regardless of your level of experience.What You Will LearnGet data into your R environment and prepare it for analysisPerform exploratory data analyses and generate meaningful visualizations of the dataGenerate various plots in R using the basic R plotting techniquesCreate presentations and learn the basics of creating apps in R for your audienceCreate and inspect the transaction dataset, performing association analysis with the Apriori algorithmVisualize associations in various graph formats and find frequent itemset using the ECLAT algorithmBuild, tune, and evaluate predictive models[...]

Big Data Analytics with Spark: A Practitioner’s Guide to Using Spark for Large Scale Data Analysis

Big Data Analytics with Spark is a step-by-step guide for learning Spark, which is an open-source fast and general-purpose cluster computing framework for large-scale data analysis. You will learn how to use Spark for different types of big data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. In addition, this book will help you become a much sought-after Spark expert.Spark is one of the hottest Big Data technologies. The amount of data generated today by devices, applications and users is exploding. Therefore, there is a critical need for tools that can analyze large-scale data and unlock value from it. Spark is a powerful technology that meets that need. You can, for example, use Spark to perform low latency computations through the use of efficient caching and iterative algorithms; leverage the features of its shell for easy and interactive Data analysis; employ its fast batch processing and low latency features to process your real time data streams and so on. As a result, adoption of Spark is rapidly growing and is replacing Hadoop MapReduce as the technology of choice for big data analytics. This book provides an introduction to Spark and related big-data technologies[...]

Next Generation Databases: NoSQLand Big Data

"It’s not easy to find such a generous book on big data and databases. Fortunately, this book is the one." Feng Yu. Computing Reviews. June 28, 2016. This is a book for enterprise architects, database administrators, and developers who need to understand the latest developments in database technologies. It is the book to help you choose the correct database technology at a time when concepts such as Big Data, NoSQL and NewSQL are making what used to be an easy choice into a complex decision with significant implications.The relational database (RDBMS) model completely dominated database technology for over 20 years. Today this "one size fits all" stability has been disrupted by a relatively recent explosion of new database technologies. These paradigm-busting technologies are powering the "Big Data" and "NoSQL" revolutions, as well as forcing fundamental changes in databases across the board.Deciding to use a relational database was once truly a no-brainer, and the various commercial relational databases competed on price, performance, reliability, and ease of use rather than on fundamental architectures. Today we are faced with choices between radically different database technologies. Choosing the right database today is a complex undertaking, with serious economic and technological consequences.Next Generation Databases demystifies[...]

Real-World Hadoop

If you’re a business team leader, CIO, business analyst, or developer interested in how Apache Hadoop and Apache HBase-related technologies can address problems involving large-scale data in cost-effective ways, this book is for you. Using real-world stories and situations, authors Ted Dunning and Ellen Friedman show Hadoop newcomers and seasoned users alike how NoSQL databases and Hadoop can solve a variety of business and research issues.You’ll learn about early decisions and pre-planning that can make the process easier and more productive. If you’re already using these technologies, you’ll discover ways to gain the full range of benefits possible with Hadoop. While you don’t need a deep technical background to get started, this book does provide expert guidance to help managers, architects, and practitioners succeed with their Hadoop projects.Examine a day in the life of big data: India’s ambitious Aadhaar projectReview tools in the Hadoop ecosystem such as Apache’s Spark, Storm, and Drill to learn how they can help youPick up a collection of technical and strategic tips that have helped others succeed with HadoopLearn from several prototypical Hadoop use cases, based on how organizations have actually applied the technologyExplore real-world stories that reveal how MapR customers combine use cases when[...]

DeFi Central

Blockchain FREE Books for Download

No post found

Finance FREE Books for Download

No post found

Soft Skills Books

No post found

Posts Categories

Consent Preferences