- Table View
- List View
97 Things Every Data Engineer Should Know
by Tobias MaceyTake advantage of today's sky-high demand for data engineers. With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Contributors from notable companies including Twitter, Google, Stitch Fix, Microsoft, Capital One, and LinkedIn share their experiences and lessons learned for overcoming a variety of specific and often nagging challenges.Edited by Tobias Macey, host of the popular Data Engineering Podcast, this book presents 97 concise and useful tips for cleaning, prepping, wrangling, storing, processing, and ingesting data. Data engineers, data architects, data team managers, data scientists, machine learning engineers, and software engineers will greatly benefit from the wisdom and experience of their peers.Topics include:The Importance of Data Lineage - Julien Le DemData Security for Data Engineers - Katharine JarmulThe Two Types of Data Engineering and Data Engineers - Jesse AndersonSix Dimensions for Picking an Analytical Data Warehouse - Gleb MezhanskiyThe End of ETL as We Know It - Paul SingmanBuilding a Career as a Data Engineer - Vijay KiranModern Metadata for the Modern Data Stack - Prukalpa SankarYour Data Tests Failed! Now What? - Sam Bail
Building Machine Learning Pipelines: Automating Model Life Cycles With Tensorflow
by Hannes Hapke Catherine NelsonCompanies are spending billions on machine learning projects, but it’s money wasted if the models can’t be deployed effectively. In this practical guide, Hannes Hapke and Catherine Nelson walk you through the steps of automating a machine learning pipeline using the TensorFlow ecosystem. You’ll learn the techniques and tools that will cut deployment time from days to minutes, so that you can focus on developing new models rather than maintaining legacy systems.Data scientists, machine learning engineers, and DevOps engineers will discover how to go beyond model development to successfully productize their data science projects, while managers will better understand the role they play in helping to accelerate these projects.Understand the steps to build a machine learning pipelineBuild your pipeline using components from TensorFlow ExtendedOrchestrate your machine learning pipeline with Apache Beam, Apache Airflow, and Kubeflow PipelinesWork with data using TensorFlow Data Validation and TensorFlow TransformAnalyze a model in detail using TensorFlow Model AnalysisExamine fairness and bias in your model performanceDeploy models with TensorFlow Serving or TensorFlow Lite for mobile devicesLearn privacy-preserving machine learning techniques
97 Things Every SRE Should Know: Collective Wisdom From The Experts
by Emil Stolarsky Jaime WooSite reliability engineering (SRE) is more relevant than ever. Knowing how to keep systems reliable has become a critical skill. With this practical book, newcomers and old hats alike will explore a broad range of conversations happening in SRE. You'll get actionable advice on several topics, including how to adopt SRE, why SLOs matter, when you need to upgrade your incident response, and how monitoring and observability differ.Editors Jaime Woo and Emil Stolarsky, co-founders of Incident Labs, have collected 97 concise and useful tips from across the industry, including trusted best practices and new approaches to knotty problems. You'll grow and refine your SRE skills through sound advice and thought-provokingquestions that drive the direction of the field.Some of the 97 things you should know:"Test Your Disaster Plan"--Tanya Reilly"Integrating Empathy into SRE Tools"--Daniella Niyonkuru"The Best Advice I Can Give to Teams"--Nicole Forsgren"Where to SRE"--Fatema Boxwala"Facing That First Page"--Andrew Louis"I Have an Error Budget, Now What?"--Alex Hidalgo"Get Your Work Recognized: Write a Brag Document"--Julia Evans and Karla Burnett
Building Serverless Applications with Google Cloud Run
by Wietse VenemaLearn how to build a real-world serverless application in the cloud that's reliable, secure, maintainable, and scalable. If you have experience building web applications on traditional infrastructure, this hands-on guide shows you how to get started with Cloud Run, a container-based serverless product on Google Cloud.Through the course of this book, you'll learn how to deploy several example applications that highlight different parts of the serverless stack on Google Cloud. Combining practical examples with fundamentals, this book will appeal to developers who are early in their learning journey as well as experienced practitioners.Build a serverless application with Google Cloud RunLearn approaches for building containers with (and without) DockerExplore Google Cloud's managed relational database: Cloud SQLUse HTTP sessions to make every user's experience uniqueExplore identity and access management (IAM) on Cloud RunProvision Google Cloud resources using TerraformLearn how to handle background task scheduling on Cloud RunMove your service from Cloud Run to Knative Serving with little effort
Think Bayes: Bayesian Statistics In Python
by Allen B. DowneyIf you know how to program, you're ready to tackle Bayesian statistics. With this book, you'll learn how to solve statistical problems with Python code instead of mathematical formulas, using discrete probability distributions rather than continuous mathematics. Once you get the math out of the way, the Bayesian fundamentals will become clearer and you'll begin to apply these techniques to real-world problems.Bayesian statistical methods are becoming more common and more important, but there aren't many resources available to help beginners. Based on undergraduate classes taught by author Allen B. Downey, this book's computational approach helps you get a solid start.Use your programming skills to learn and understand Bayesian statisticsWork with problems involving estimation, prediction, decision analysis, evidence, and Bayesian hypothesis testingGet started with simple examples, using coins, dice, and a bowl of cookiesLearn computational methods for solving real-world problems
Practical Natural Language Processing: A Comprehensive Guide to Building Real-World NLP Systems
by Sowmya Vajjala Bodhisattwa Majumder Anuj Gupta Harshit SuranaMany books and courses tackle natural language processing (NLP) problems with toy use cases and well-defined datasets. But if you want to build, iterate, and scale NLP systems in a business setting and tailor them for particular industry verticals, this is your guide. Software engineers and data scientists will learn how to navigate the maze of options available at each step of the journey.Through the course of the book, authors Sowmya Vajjala, Bodhisattwa Majumder, Anuj Gupta, and Harshit Surana will guide you through the process of building real-world NLP solutions embedded in larger product setups. You’ll learn how to adapt your solutions for different industry verticals such as healthcare, social media, and retail.With this book, you’ll:Understand the wide spectrum of problem statements, tasks, and solution approaches within NLPImplement and evaluate different NLP applications using machine learning and deep learning methodsFine-tune your NLP solution based on your business problem and industry verticalEvaluate various algorithms and approaches for NLP product tasks, datasets, and stagesProduce software solutions following best practices around release, deployment, and DevOps for NLP systemsUnderstand best practices, opportunities, and the roadmap for NLP from a business and product leader’s perspective
Design Patterns for Cloud Native Applications
by Kasun Indrasiri Sriskandarajah SuhothayanWith the immense cost savings and scalability the cloud provides, the rationale for building cloud native applications is no longer in question. The real issue is how. With this practical guide, developers will learn about the most commonly used design patterns for building cloud native applications using APIs, data, events, and streams in both greenfield and brownfield development.You'll learn how to incrementally design, develop, and deploy large and effective cloud native applications that you can manage and maintain at scale with minimal cost, time, and effort. Authors Kasun Indrasiri and Sriskandarajah Suhothayan highlight use cases that effectively demonstrate the challenges you might encounter at each step.Learn the fundamentals of cloud native applicationsExplore key cloud native communication, connectivity, and composition patternsLearn decentralized data management techniquesUse event-driven architecture to build distributed and scalable cloud native applicationsExplore the most commonly used patterns for API management and consumptionExamine some of the tools and technologies you'll need for building cloud native systems
CockroachDB: Distributed Data at Scale
by Guy Harrison Jesse Seldess Ben DarnellGet the lowdown on CockroachDB, the elastic SQL database built to handle the demands of today's data-driven world. With this practical guide, software developers, architects, and DevOps teams will discover the advantages of building on a distributed SQL database. You'll learn how to create applications that scale elastically and provide seamless delivery for end users while remaining exceptionally resilient and indestructible.Written from scratch for the cloud and architected to scale elastically to handle the demands of cloud native and open source, CockroachDB makes it easier to build and scale modern applications. If you're familiar with distributed systems, you'll quickly discover the benefits of strong data correctness and consistency guarantees as well as optimizations for delivering ultralow latencies to globally distributed end users.With this thorough guide, you'll learn how to:Plan and build applications for distributed infrastructure, including data modeling and schema designMigrate data into CockroachDBRead and write data and run ACID transactions across distributed infrastructureOptimize queries for performance across geographically distributed replicasPlan a CockroachDB deployment for resiliency across single-region and multiregion clustersSecure, monitor, and optimize your CockroachDB deployment
Making Futures Work: Integrating Futures Thinking for Design, Innovation, and Strategy
by Phil BalagtasLearn how to get started with Futures Thinking. With this practical guide, Phil Balagtas, founder of the Design Futures Initiative and the global Speculative Futures network, shows you how designers and futurists have made futures work at companies such as Atari, IBM, Apple, Disney, Autodesk, Lufthansa, and McKinsey & Company.This book demystifies the process of Futures Thinking into a language that's practical and useful for both designers and strategists. You'll learn about Strategic Foresight for using ideas about the future to anticipate and prepare for change; explore Speculative Design to deal with the relationship between science, technology, and humans; and Design Fiction to explore and critique possible futures.Balagtas also shares stories from his journey to build a global community and describes how he works with clients to reshape the futures vocabulary. With this guide, you'll learn how to:Prepare your client, team, and/or audience for futuresFacilitate and work with the fundamental methods and frameworksGain advocacy and support within your organizationProvide measurable value from the process and outcomesBuild a futures culture and teamSustain a culture and support system beyond projects
Cloud Native Infrastructure with Azure: Building and Managing Cloud Native Applications
by Nishant Singh Michael KehoeThe cloud is becoming the de facto home for companies ranging from enterprises to startups. Moving to the cloud means moving your applications from monolith to microservices. But once you do, running and maintaining these services brings its own level of complexity. The answer? Modularity, deployability, observability, and self-healing capacity through cloud native development.With this practical book, Nishant Singh and Michael Kehoe show you how to build a true cloud native infrastructure using Microsoft Azure or another cloud computing solution by following guidelines from the Cloud Native Computing Foundation (CNCF). DevOps and site reliability engineers will learn how adapting applications to cloud native early in the design phase helps you fully utilize the elasticity and distributed nature of the cloud.This book helps you explore:Why go cloud native?How to use infrastructure as codeWhat it takes to containerize an applicationWhy and how Kubernetes is the "grand orchestrator"How to create a Kubernetes cluster on AzureHow observability complements monitoringHow to use service discovery and a service mesh to find new territoriesHow networking and policy management serve as gatekeepersHow distributed databases and storage work
SQL Cookbook
by Anthony Molinaro Robert De GraafYou may know SQL basics, but are you taking advantage of its expressive power? This second edition applies a highly practical approach to Structured Query Language (SQL) so you can create and manipulate large stores of data. Based on real-world examples, this updated cookbook provides a framework to help you construct solutions and executable examples in several flavors of SQL, including Oracle, DB2, SQL Server, MySQL, and PostgreSQL.SQL programmers, analysts, data scientists, database administrators, and even relatively casual SQL users will find SQL Cookbook to be a valuable problem-solving guide for everyday issues. No other resource offers recipes in this unique format to help you tackle nagging day-to-day conundrums with SQL.The second edition includes:Fully revised recipes that recognize the greater adoption of window functions in SQL implementationsAdditional recipes that reflect the widespread adoption of common table expressions (CTEs) for more readable, easier-to-implement solutionsNew recipes to make SQL more useful for people who aren't database experts, including data scientistsExpanded solutions for working with numbers and stringsUp-to-date SQL recipes throughout the book to guide you through the basics
Practical Fairness: Achieving Fair And Secure Data Models
by Aileen NielsenFairness is an increasingly important topic as machine learning and AI more generally take over the world. While this is an active area of research, many realistic best practices are emerging at all steps along the data pipeline, from data selection and preprocessing to blackbox model audits. This book will guide you through the technical, legal, and ethical aspects of making your code fair and secure while highlighting cutting edge academic research and ongoing legal developments related to fairness and algorithms.There is mounting evidence that the widespread deployment of machine learning and artificial intelligence in business and government is reproducing the same biases we are trying to fight in the real world. For this reason, fairness is an increasingly important consideration for the data scientist. Yet discussions of what fairness means in terms of actual code are few and far between. This code will show you how to code fairly as well as cover basic concerns related to data security and privacy from a fairness perspective.
JavaScript: Master the World's Most-Used Programming Language (Definitive Guide Ser.)
by David FlanaganFor web developers and other programmers interested in using JavaScript, this bestselling book provides the most comprehensive JavaScript material on the market. The seventh edition represents a significant update, with new information for ECMAScript 2020, and new chapters on language-specific features.JavaScript: The Definitive Guide is ideal for experienced programmers who want to learn the programming language of the web, and for current JavaScript programmers who want to master it.
Zero to Genetic Engineering Hero: The beginner's guide to programming bacteria at home, school, & in the makerspace
by Justin Pahara Julie LegaultZero to Genetic Engineering Hero is made to provide you with a first glimpse of the inner-workings of a cell. It further focuses on skill-building for genetic engineering and the Biology-as-a-Technology mindset (BAAT). This book is designed and written for hands-on learners who have little knowledge of biology or genetic engineering. This book focuses on the reader mastering the necessary skills of genetic engineering while learning about cells and how they function. The goal of this book is to take you from no prior biology and genetic engineering knowledge toward a basic understanding of how a cell functions, and how they are engineered, all while building the skills needed to do so.
Programming PHP: Creating Dynamic Web Pages (Apresspod Ser.)
by Kevin Tatroe Peter MacIntyreWhy is PHP the most widely used programming language on the web? This updated edition teaches everything you need to know to create effective web applications using the latest features in PHP 7.4. You’ll start with the big picture and then dive into language syntax, programming techniques, and other details, using examples that illustrate both correct usage and common idioms. If you have a working knowledge of HTML, authors Kevin Tatroe and Peter MacIntyre provide many style tips and practical programming advice in a clear and concise manner to help you become a top-notch PHP programmer.Understand what’s possible when you use PHP programsLearn language fundamentals, including data types, variables, operators, and flow control statementsExplore functions, strings, arrays, and objectsApply common web application techniques, such as form processing, data validation, session tracking, and cookiesInteract with relational databases like MySQL or NoSQL databases such as MongoDBGenerate dynamic images, create PDF files, and parse XML filesLearn secure scripts, error handling, performance tuning, and other advanced topicsGet a quick reference to PHP core functions and standard extensions
Kubernetes Best Practices: Blueprints for Building Successful Applications on Kubernetes
by Brendan Burns Eddie Villalba Dave Strebel Lachlan EvensonIn this practical guide, four Kubernetes professionals with deep experience in distributed systems, enterprise application development, and open source will guide you through the process of building applications with this container orchestration system. They distill decades of experience from companies that are successfully running Kubernetes in production and provide concrete code examples to back the methods presented in this book.Revised to cover all the latest Kubernetes features, new tooling, and deprecations, this book is ideal for those who are familiar with basic Kubernetes concepts but want to get up to speed on the latest best practices. You'll learn exactly what you need to know to build your best app with Kubernetes the first time.Set up and develop applications in KubernetesLearn patterns for monitoring, securing your systems, and managing upgrades, rollouts, and rollbacksIntegrate services and legacy applications and develop higher-level platforms on top of KubernetesRun machine learning workloads in KubernetesEnsure pod and container securityUnderstand issues that have become increasingly critical to the successful implementation of Kubernetes, such as chaos engineering/testing, GitOps, service mesh, and observability
AI and Machine Learning for On-Device Development: A Programmer's Guide
by Laurence MoroneyAI is nothing without somewhere to run it. Now that mobile devices have become the primary computing device for most people, it's essential that mobile developers add AI to their toolbox. This insightful book is your guide to creating and running models on popular mobile platforms such as iOS and Android.Laurence Moroney, lead AI advocate at Google, offers an introduction to machine learning techniques and tools, then walks you through writing Android and iOS apps powered by common ML models like computer vision and text recognition, using tools such as ML Kit, TensorFlow Lite, and Core ML. If you're a mobile developer, this book will help you take advantage of the ML revolution today.Explore the options for implementing ML and AI on mobile devicesCreate ML models for iOS and AndroidWrite ML Kit and TensorFlow Lite apps for iOS and Android, and Core ML/Create ML apps for iOSChoose the best techniques and tools for your use case, such as cloud-based versus on-device inference and high-level versus low-level APIsLearn privacy and ethics best practices for ML on devices
Efficient Linux at the Command Line: Boost Your Command-Line Skills
by Daniel J. BarrettTake your Linux skills to the next level! Whether you're a system administrator, software developer, site reliability engineer, or enthusiastic hobbyist, this practical, hands-on book will help you work faster, smarter, and more efficiently. You'll learn how to create and run complex commands that solve real business problems, process and retrieve information, and automate manual tasks.You'll also truly understand what happens behind the shell prompt, so no matter which commands you run, you can be more successful in everyday Linux use and more competitive on the job market.As you build intermediate to advanced command-line skills, you'll learn how to:Choose or construct commands that get your work done quicklyRun commands efficiently and navigate the Linux filesystem with easeBuild powerful, complex commands out of simpler onesTransform text files and query them like databases to achieve business goalsControl Linux point-and-click features from the command line
Operating OpenShift: An SRE Approach to Managing Infrastructure
by Rick Rackow Manuel DewaldKubernetes has gained significant popularity over the past few years, with OpenShift as one of its most mature and prominent distributions. But while OpenShift provides several layers of abstraction over vanilla Kubernetes, this software can quickly become overwhelming because of itsrich feature set and functionality. This practical book helps you understand and manage OpenShift clusters from minimal deployment to large multicluster installations.Principal site reliability engineers Rick Rackow and Manuel Dewald, who worked together on Red Hat's managed OpenShift offering for years, provide valuable advice to help your teams operate OpenShift clusters efficiently. Designed for SREs, system administrators, DevOps engineers, andcloud architects, Operating OpenShift encourages consistent and easy container orchestration and helps reduce the effort of deploying a Kubernetes platform. You'll learn why OpenShift has become highly attractive to enterprises large and small.Learn OpenShift core concepts and deployment strategiesExplore multicluster OpenShift Container Platform deploymentsAdminister OpenShift clusters following best practicesLearn best practices for deploying workloads to OpenShiftMonitor OpenShift clusters through state-of-the-art conceptsBuild and deploy Kubernetes operators to automate administrative tasksConfigure OpenShift clusters using a GitOps approach
Data Quality Fundamentals: A Practitioner's Guide to Building Trustworthy Data Pipelines
by Barr Moses Lior Gavish Molly VorwerckDo your product dashboards look funky? Are your quarterly reports stale? Is the data set you're using broken or just plain wrong? These problems affect almost every team, yet they're usually addressed on an ad hoc basis and in a reactive manner. If you answered yes to these questions, this book is for you. Many data engineering teams today face the "good pipelines, bad data" problem. It doesn't matter how advanced your data infrastructure is if the data you're piping is bad. In this book, Barr Moses, Lior Gavish, and Molly Vorwerck, from the data observability company Monte Carlo, explain how to tackle data quality and trust at scale by leveraging best practices and technologies used by some of the world's most innovative companies. Build more trustworthy and reliable data pipelinesWrite scripts to make data checks and identify broken pipelines with data observabilityLearn how to set and maintain data SLAs, SLIs, and SLOsDevelop and lead data quality initiatives at your companyLearn how to treat data services and systems with the diligence of production softwareAutomate data lineage graphs across your data ecosystemBuild anomaly detectors for your critical data assets
Deploying to OpenShift: A Guide for Busy Developers
by Graham DumpletonGet an in-depth tour of OpenShift, the container-based software deployment and management platform from Red Hat that provides a secure multi-tenant environment for the enterprise. This practical guide describes in detail how OpenShift, building on Kubernetes, enables you to automate the way you create, ship, and run applications in a containerized environment.Author Graham Dumpleton provides the knowledge you need to make the best use of the OpenShift container platform to deploy not only your cloud-native applications, but also more traditional stateful applications. Developers and administrators will learn how to run, access, and manage containers in OpenShift, including how to orchestrate them at scale.Build application container images from source and deploy themImplement and extend application image buildersUse incremental and chained builds to accelerate build timesAutomate builds by using a webhook to link OpenShift to a Git repositoryAdd configuration and secrets to the container as project resourcesMake an application visible outside the OpenShift clusterManage persistent storage inside an OpenShift containerMonitor application health and manage the application lifecycleThis book is a perfect follow-up to OpenShift for Developers: A Guide for Impatient Beginners (O’Reilly).
Prompt Engineering for LLMs: The Art and Science of Building Large Language Model–Based Applications
by John Berryman Albert ZieglerLarge language models (LLMs) are revolutionizing the world, promising to automate tasks and solve complex problems. A new generation of software applications are using these models as building blocks to unlock new potential in almost every domain, but reliably accessing these capabilities requires new skills. This book will teach you the art and science of prompt engineering-the key to unlocking the true potential of LLMs. Industry experts John Berryman and Albert Ziegler share how to communicate effectively with AI, transforming your ideas into a language model-friendly format. By learning both the philosophical foundation and practical techniques, you'll be equipped with the knowledge and confidence to build the next generation of LLM-powered applications.Understand LLM architecture and learn how to best interact with itDesign a complete prompt-crafting strategy for an applicationGather, triage, and present context elements to make an efficient promptMaster specific prompt-crafting techniques like few-shot learning, chain-of-thought prompting, and RAG
Fusion 360 for Makers: Design Your Own Digital Models for 3D Printing and CNC Fabrication
by Lydia Sloan ClineLearn how to use Autodesk Fusion 360 to digitally model your own original projects for a 3D printer or a CNC device.
iOS 15 Programming Fundamentals with Swift: Swift, Xcode, and Cocoa Basics
by Matt NeuburgMove into iOS development by getting a firm grasp of its fundamentals, including the Xcode 13 IDE, Cocoa Touch, and the latest version of Apple's acclaimed programming language, Swift 5.5. With this thoroughly updated guide, you'll learn the Swift language, understand Apple's Xcode development tools, and discover the Cocoa framework.Explore Swift's object-oriented conceptsBecome familiar with built-in Swift typesDive deep into Swift objects, protocols, and genericsTour the life cycle of an Xcode projectLearn how nibs are loadedUnderstand Cocoa's event-driven designCommunicate with C and Objective-CIn this edition, catch up on the latest iOS programming features:Structured concurrency: async/await, tasks, and actorsSwift native formatters and attributed stringsLazy locals and throwing gettersEnhanced collections with the Swift Algorithms and Collections packagesXcode tweaks: column breakpoints, package collections, and Info.plist build settingsImprovements in Git integration, localization, unit testing, documentation, and distributionAnd more!
Design for How People Think: Using Brain Science to Build Better Products
by John WhalenUser experience doesn’t happen on a screen; it happens in the mind, and the experience is multidimensional and multisensory. This practical book will help you uncover critical insights about how your customers think so you can create products or services with an exceptional experience.Corporate leaders, marketers, product owners, and designers will learn how cognitive processes from different brain regions form what we perceive as a singular experience. Author John Whalen shows you how anyone on your team can conduct "contextual interviews" to unlock insights. You’ll then learn how to apply that knowledge to design brilliant experiences for your customers.Learn about the "six minds" of user experience and how each contributes to the perception of a singular experienceFind out how your team—without any specialized training in psychology—can uncover critical insights about your customers’ conscious and unconscious processesLearn how to immediately apply what you’ve learned to improve your products and servicesExplore practical examples of how the Fortune 100 used this system to build highly successful experiences