FreeComputerBooks.com
Links to Free Computer, Mathematics, Technical Books all over the World
|
|
- Title: The Data Engineering Cookbook: Mastering The Plumbing Of Data Science
- Author(s) Andreas Kretz
- Publisher: The Data Engineering Academy; eBook (Apache Licensed)
- License(s): Apache License
- Hardcover/Paperback: N/A
- eBook: PDF
- Language: English
- ISBN-10/ASIN: N/A
- ISBN-13: N/A
- Share This:
This book provides a clear understanding of data modeling techniques and pipelining. At the beginning of the book, you will learn the basics of data engineering. Then you will learn the technologies and frameworks required to build data pipelines to work with large datasets.
This is a practical and comprehensive guide. This book deals with all the stuff that happens around data engineering like storage, models, structures, access patterns, encoding, replication, partitioning, distributed systems, batch & stream processing, and the future of data systems.
By reading this book, you get a clear understanding of real-world big data architecture. This book is good for you if you are working on or interviewing for big data engineering. This book provides an amazing introduction to the fundamental concepts behind the much-hyped Big Data tools
You will also learn how to transform and clean data and perform analytics to get the most out of your data. At the last of the book, you will learn how to work with big data of varying complexity and production databases, and build data pipelines. You will also build architectures on which you’ll learn how to deploy data pipelines using real-world examples.
About the Author- N/A
- Data Science and Data Engineering
- Data Analysis and Data Mining
- Big Data
- Machine Learning
- Unix/Linux Shell Scripting
- The Data Engineering Cookbook: Mastering The Plumbing Of Data Science (Andreas Kretz)
- The Mirror Site (1) - PDF
- The Mirror Site (2) - PDF
- Book Homepage (HTML and PDF Edition, Source Code, etc.)
-
The Evolving Role of the Data Engineer (Andy Oram)
If you're pursuing a career in data engineering or looking for ways to adapt your enterprise to the world of big data, this report shares the knowledge you need to find your way forward.
-
Data Science at the Command Line, 2nd Ed. (Jeroen Janssens)
This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. Learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data.
-
97 Things Every Data Engineer Should Know (Tobias Macey)
With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Experts share their experiences and lessons learned for overcoming a variety of specific and often nagging challenges.
-
The Ultimate Guide to Effective Data Cleaning
With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Experts share their experiences and lessons learned for overcoming a variety of specific and often nagging challenges.
-
The Data Engineer's Guide to Apache Spark (Databricks)
This book is for data engineers looking to leverage the immense growth of Apache Spark to build faster and more reliable data pipelines. It leverages Spark's amazing speed, scalability, simplicity, and versatility to build practical Big Data solutions.
-
Building the Data Lakehouse (Bill Inmon, et al.)
Learn about the features and architecture of the data lakehouse, along with its powerful analytical infrastructure. The data lakehouse is the next generation of the data warehouse and data lake.
-
97 Things Every Cloud Engineer Should Know: from the Experts
With this book, professionals from around the world provide valuable insight into today's cloud engineering role. It explore the entire cloud computing experience, including fundamentals, architecture, and migration.
:
|
|