Free Ebooks at your fingertips

All Free Ebooks

Hadoop in Practice: Includes 104 Techniques

SummaryHadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere.Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.About the BookIt's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date[...]

Head First Data Analysis: A learner’s guide to big numbers, statistics, and good decisions

Today, interpreting data is a critical decision-making factor for businesses and organizations. If your job requires you to manage and analyze all kinds of data, turn to Head First Data Analysis, where you'll quickly learn how to collect and organize data, sort the distractions from the truth, find meaningful patterns, draw conclusions, predict the future, and present your findings to others.Whether you're a product developer researching the market viability of a new product or service, a marketing manager gauging or predicting the effectiveness of a campaign, a salesperson who needs data to support product presentations, or a lone entrepreneur responsible for all of these data-intensive functions and more, the unique approach in Head First Data Analysis is by far the most efficient way to learn what you need to know to convert raw data into a vital business tool.You'll learn how to:Determine which data sources to use for collecting informationAssess data quality and distinguish signal from noiseBuild basic data models to illuminate patterns, and assimilate new information into the modelsCope with ambiguous informationDesign experiments to test hypotheses and draw conclusionsUse segmentation to organize your data within discrete market groupsVisualize data distributions to reveal new relationships and persuade othersPredict the future[...]

Mining the Social Web: Data Mining Facebook, Twitter, Linkedin, Google+, Github, And More

How can you tap into the wealth of social web data to discover who’s making connections with whom, what they’re talking about, and where they’re located? With this expanded and thoroughly revised edition, you’ll learn how to acquire, analyze, and summarize data from all corners of the social web, including Facebook, Twitter, LinkedIn, Google+, GitHub, email, websites, and blogs.Employ the Natural Language Toolkit, NetworkX, and other scientific computing tools to mine popular social web sitesApply advanced text-mining techniques, such as clustering and TF-IDF, to extract meaning from human language dataBootstrap interest graphs from GitHub by discovering affinities among people, programming languages, and coding projectsBuild interactive visualizations with D3.js, an extraordinarily flexible HTML5 and JavaScript toolkitTake advantage of more than two-dozen Twitter recipes, presented in O’Reilly’s popular "problem/solution/discussion" cookbook formatThe example code for this unique data science book is maintained in a public GitHub repository. It’s designed to be easily accessible through a turnkey virtual machine that facilitates interactive learning with an easy-to-use collection of IPython Notebooks[...]

Beginning Oracle Database 11g Administration: From Novice to Professional (Expert’s Voice in Oracle)

This book, written by veteran Oracle database administrator Iggy Fernandez, a regular on the Oracle conference circuit and the editor of NoCOUG Journal, is a manageable introduction to key Oracle database administration topics including planning, installation, monitoring, troubleshooting, maintenance, and backups, to name just a few. As is clear from the table of contents, this book is not simply a recitation of Oracle Database features such as what you find in the reference guides available for free download on the Oracle web site. For example, the chapter on database monitoring explains how to monitor database availability, database changes, database security, database backups, database growth, database workload, database performance, and database capacity. The chapters of this book are logically organized into four parts that closely track the way your database administration career will naturally evolve. Part 1 gives you necessary background in relational database theory and Oracle Database concepts, Part 2 teaches you how to implement an Oracle database correctly, Part 3 exposes you to the daily routine of a database administrator, and Part 4 introduces you to the fine art of performance tuning. Each chapter has exercises designed to help you apply the lessons of the chapter. Each chapter also[...]

Oracle Database 11g DBA Handbook (Oracle Press)

Publisher's Note: Products purchased from Third Party sellers are not guaranteed by the publisher for quality, authenticity, or access to any online entitlements included with the product.The Essential Resource for Oracle DBAs--Fully Updated and ExpandedManage a flexible, highly available Oracle database with help from the expert information contained in this exclusive Oracle Press guide. Fully revised to cover every new feature and utility, Oracle Database 11g DBA Handbook shows how to perform a new installation, upgrade from previous versions, configure hardware and software for maximum efficiency, and employ bulletproof security. You will learn to automate the backup and recovery process, provide transparent failover capability, audit and tune performance, and distribute your enterprise databases with Oracle Net.Plan and deploy permanent, temporary, and bigfile tablespacesOptimize disk allocation, CPU usage, I/O throughput, and SQL queriesDevelop powerful database management applicationsGuard against human errors using Oracle Flashback and Oracle Automatic Undo ManagementDiagnose and tune system performance using Oracle Automatic Workload Repository and SQL Tuning SetsImplement robust security using authentication, authorization, fine-grained auditing, and fine-grained access controlMaintain high availability using Oracle Real Application Clusters and Oracle Active Data GuardRespond more efficiently to failure scenarios by leveraging the Oracle Automatic Diagnostic Repository and the Oracle Repair AdvisorBack[...]

The Logic of Life: The Rational Economics of an Irrational World

Life sometimes seems illogical. Individuals do strange things: take drugs, have unprotected sex, mug each other. Love seems irrational, and so does divorce. On a larger scale, life seems no fairer or easier to fathom: Why do some neighborhoods thrive and others become ghettos? Why is racism so persistent? Why is your idiot boss paid a fortune for sitting behind a mahogany altar? Thorny questions–and you might be surprised to hear the answers coming from an economist. But award-winning journalist Tim Harford likes to spring surprises. In this deftly reasoned book, he argues that life is logical after all. Under the surface of everyday insanity, hidden incentives are at work, and Harford shows these incentives emerging in the most unlikely places[...]

Webbots, Spiders, and Screen Scrapers: A Guide to Developing Internet Agents with PHP/CURL

There's a wealth of data online, but sorting and gathering it by hand can be tedious and time consuming. Rather than click through page after endless page, why not let bots do the work for you?Webbots, Spiders, and Screen Scrapers will show you how to create simple programs with PHP/CURL to mine, parse, and archive online data to help you make informed decisions. Michael Schrenk, a highly regarded webbot developer, teaches you how to develop fault-tolerant designs, how best to launch and schedule the work of your bots, and how to create Internet agents that:Send email or SMS notifications to alert you to new information quicklySearch different data sources and combine the results on one page, making the data easier to interpret and analyzeAutomate purchases, auction bids, and other online activities to save timeSample projects for automating tasks like price monitoring and news aggregation will show you how to put the concepts you learn into practice.This second edition of Webbots, Spiders, and Screen Scrapers includes tricks for dealing with sites that are resistant to crawling and scraping, writing stealthy webbots that mimic human search behavior, and using regular expressions to harvest specific data. As you discover the possibilities of web[...]

Data Warehousing in the Age of Big Data (The Morgan Kaufmann Series on Business Intelligence)

Data Warehousing in the Age of the Big Data will help you and your organization make the most of unstructured data with your existing data warehouse. As Big Data continues to revolutionize how we use data, it doesn't have to create more confusion. Expert author Krish Krishnan helps you make sense of how Big Data fits into the world of data warehousing in clear and concise detail. The book is presented in three distinct parts. Part 1 discusses Big Data, its technologies and use cases from early adopters. Part 2 addresses data warehousing, its shortcomings, and new architecture options, workloads, and integration techniques for Big Data and the data warehouse. Part 3 deals with data governance, data visualization, information life-cycle management, data scientists, and implementing a Big Data–ready data warehouse. Extensive appendixes include case studies from vendor implementations and a special segment on how we can build a healthcare information factory. Ultimately, this book will help you navigate through the complex layers of Big Data and data warehousing while providing you information on how to effectively think about using all these technologies and the architectures to design the next-generation data warehouse.Learn how to leverage Big Data by effectively integrating it[...]

DeFi Central

Blockchain FREE Books for Download

No post found

Finance FREE Books for Download

No post found

Soft Skills Books

No post found

Posts Categories

Consent Preferences