Processing ......
FreeComputerBooks.com
Links to Free Computer, Mathematics, Technical Books all over the World
 
The Data Engineering Cookbook: Mastering The Plumbing Of Data Science
🌠 Top Free Machine Learning Books - 100% Free or Open Source!
  • Title: The Data Engineering Cookbook: Mastering The Plumbing Of Data Science
  • Author(s) Andreas Kretz
  • Publisher: The Data Engineering Academy; eBook (Apache Licensed)
  • License(s): Apache License
  • Hardcover/Paperback: N/A
  • eBook: PDF
  • Language: English
  • ISBN-10/ASIN: N/A
  • ISBN-13: N/A
  • Share This:  

Book Description

This book provides a clear understanding of data modeling techniques and pipelining. At the beginning of the book, you will learn the basics of data engineering. Then you will learn the technologies and frameworks required to build data pipelines to work with large datasets.

This is a practical and comprehensive guide. This book deals with all the stuff that happens around data engineering like storage, models, structures, access patterns, encoding, replication, partitioning, distributed systems, batch & stream processing, and the future of data systems.

By reading this book, you get a clear understanding of real-world big data architecture. This book is good for you if you are working on or interviewing for big data engineering. This book provides an amazing introduction to the fundamental concepts behind the much-hyped Big Data tools

You will also learn how to transform and clean data and perform analytics to get the most out of your data. At the last of the book, you will learn how to work with big data of varying complexity and production databases, and build data pipelines. You will also build architectures on which you’ll learn how to deploy data pipelines using real-world examples.

About the Author
  • N/A
Reviews, Ratings, and Recommendations: Related Book Categories: Read and Download Links: Similar Books:
  • The Evolving Role of the Data Engineer (Andy Oram)

    If you're pursuing a career in data engineering or looking for ways to adapt your enterprise to the world of big data, this report shares the knowledge you need to find your way forward.

  • Data Science at the Command Line, 2nd Ed. (Jeroen Janssens)

    This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. Learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data.

  • 97 Things Every Data Engineer Should Know (Tobias Macey)

    With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Experts share their experiences and lessons learned for overcoming a variety of specific and often nagging challenges.

  • The Ultimate Guide to Effective Data Cleaning

    With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Experts share their experiences and lessons learned for overcoming a variety of specific and often nagging challenges.

  • The Data Engineer's Guide to Apache Spark (Databricks)

    This book is for data engineers looking to leverage the immense growth of Apache Spark to build faster and more reliable data pipelines. It leverages Spark's amazing speed, scalability, simplicity, and versatility to build practical Big Data solutions.

  • Building the Data Lakehouse (Bill Inmon, et al.)

    Learn about the features and architecture of the data lakehouse, along with its powerful analytical infrastructure. The data lakehouse is the next generation of the data warehouse and data lake.

  • 97 Things Every Cloud Engineer Should Know: from the Experts

    With this book, professionals from around the world provide valuable insight into today's cloud engineering role. It explore the entire cloud computing experience, including fundamentals, architecture, and migration.

Book Categories
:
Other Categories
Resources and Links