Browse Results

Showing 25,076 through 25,100 of 100,000 results

Data Science: Konzepte, Erfahrungen, Fallstudien und Praxis

by Detlev Frick Andreas Gadatsch Jens Kaufmann Birgit Lankes Christoph Quix Andreas Schmidt Uwe Schmitz

Data Science ist in vielen Organisationen angekommen und oft alltägliche Praxis. Dennoch stehen viele Verantwortliche vor der Herausforderung, sich erstmalig mit konkreten Fragestellungen zu beschäftigen oder laufende Projekte weiterzuentwickeln. Die Spannbreite der Methoden, Werkzeuge und Anwendungsmöglichkeiten ist sehr groß und entwickelt sich kontinuierlich weiter. Die Vielzahl an Publikationen zu Data Science ist spezialisiert und behandelt fokussiert Einzelaspekte. Das vorliegende Werk gibt den Leserinnen und Lesern eine umfassende Orientierung zum Status Quo aus der wissenschaftlichen Perspektive und zahlreiche vertiefende Darstellungen praxisrelevanter Aspekte. Die Inhalte bauen auf den wissenschaftlichen CAS-Zertifikatskursen zu Big Data und Data Science der Hochschule Niederrhein in Kooperation mit der Hochschule Bonn-Rhein-Sieg und der FH Dortmund auf. Sie berücksichtigen wissenschaftliche Grundlagen und Vertiefungen, aber auch konkrete Erfahrungen aus Data Science Projekten. Das Buch greift praxisrelevante Fragen auf wissenschaftlichem Niveau aus Sicht der Rollen eines „Data Strategist“, „Data Architect“ und „Data Analyst“ auf und bindet erprobte Praxiserfahrungen u. a. von Seminarteilnehmern mit ein. Das Buch gibt für Interessierte einen Einblick in die aktuell relevante Vielfalt der Aspekte zu Data Science bzw. Big Data und liefert Hinweise für die praxisnahe Umsetzung.

Data Science: Create Teams That Ask the Right Questions and Deliver Real Value

by Doug Rose

Learn how to build a data science team within your organization rather than hiring from the outside. Teach your team to ask the right questions to gain actionable insights into your business. Most organizations still focus on objectives and deliverables. Instead, a data science team is exploratory. They use the scientific method to ask interesting questions and run small experiments. Your team needs to see if the data illuminate their questions. Then, they have to use critical thinking techniques to justify their insights and reasoning. They should pivot their efforts to keep their insights aligned with business value. Finally, your team needs to deliver these insights as a compelling story. Insight!: How to Build Data Science Teams that Deliver Real Business Value shows that the most important thing you can do now is help your team think about data. Management coach Doug Rose walks you through the process of creating and managing effective data science teams. You will learn how to find the right people inside your organization and equip them with the right mindset. The book has three overarching concepts: You should mine your own company for talent. You can't change your organization by hiring a few data science superheroes. You should form small, agile-like data teams that focus on delivering valuable insights early and often. You can make real changes to your organization by telling compelling data stories. These stories are the best way to communicate your insights about your customers, challenges, and industry. What Your Will Learn: Create data science teams from existing talent in your organization to cost-efficiently extract maximum business value from your organization's data Understand key data science terms and concepts Follow practical guidance to create and integrate an effective data science team with key roles and the responsibilities for each team member Utilize the data science life cycle (DSLC) to model essential processes and practices for delivering value Use sprints and storytelling to help your team stay on track and adapt to new knowledge Who This Book Is For Data science project managers and team leaders. The secondary readership is data scientists, DBAs, analysts, senior management, HR managers, and performance specialists.

Data Science: A First Introduction (Chapman & Hall/CRC Data Science Series)

by Tiffany Timbers Trevor Campbell Melissa Lee

Data Science: A First Introduction focuses on using the R programming language in Jupyter notebooks to perform data manipulation and cleaning, create effective visualizations, and extract insights from data using classification, regression, clustering, and inference. The text emphasizes workflows that are clear, reproducible, and shareable, and includes coverage of the basics of version control. All source code is available online, demonstrating the use of good reproducible project workflows. Based on educational research and active learning principles, the book uses a modern approach to R and includes accompanying autograded Jupyter worksheets for interactive, self-directed learning. The book will leave readers well-prepared for data science projects. The book is designed for learners from all disciplines with minimal prior knowledge of mathematics and programming. The authors have honed the material through years of experience teaching thousands of undergraduates in the University of British Columbia’s DSCI100: Introduction to Data Science course.

Data Science, AI, and Machine Learning in Drug Development (Chapman & Hall/CRC Biostatistics Series)

by Harry Yang

The confluence of big data, artificial intelligence (AI), and machine learning (ML) has led to a paradigm shift in how innovative medicines are developed and healthcare delivered. To fully capitalize on these technological advances, it is essential to systematically harness data from diverse sources and leverage digital technologies and advanced analytics to enable data-driven decisions. Data science stands at a unique moment of opportunity to lead such a transformative change. Intended to be a single source of information, Data Science, AI, and Machine Learning in Drug Research and Development covers a wide range of topics on the changing landscape of drug R & D, emerging applications of big data, AI and ML in drug development, and the build of robust data science organizations to drive biopharmaceutical digital transformations. Features Provides a comprehensive review of challenges and opportunities as related to the applications of big data, AI, and ML in the entire spectrum of drug R & D Discusses regulatory developments in leveraging big data and advanced analytics in drug review and approval Offers a balanced approach to data science organization build Presents real-world examples of AI-powered solutions to a host of issues in the lifecycle of drug development Affords sufficient context for each problem and provides a detailed description of solutions suitable for practitioners with limited data science expertise

Data Science and Analytics: 4th International Conference On Recent Developments In Science, Engineering And Technology, Redset 2017, Gurgaon, India, October 13-14, 2017, Revised Selected Papers (Communications In Computer And Information Science #799)

by Brajendra Panda Sudeep Sharma Nihar Ranjan Roy

This book constitutes the refereed proceedings of the 4th International Conference on Recent Developments in Science, Engineering and Technology, REDSET 2017, held in Gurgaon, India, in October 2017. The 66 revised full papers presented were carefully reviewed and selected from 329 submissions. The papers are organized in topical sections on big data analysis, data centric programming, next generation computing, social and web analytics, security in data science analytics.

Data Science and Analytics for SMEs: Consulting, Tools, Practical Use Cases

by Afolabi Ibukun Tolulope

Master the tricks and techniques of business analytics consulting, specifically applicable to small-to-medium businesses (SMEs). Written to help you hone your business analytics skills, this book applies data science techniques to help solve problems and improve upon many aspects of a business' operations. SMEs are looking for ways to use data science and analytics, and this need is becoming increasingly pressing with the ongoing digital revolution. The topics covered in the books will help to provide the knowledge leverage needed for implementing data science in small business. The demand of small business for data analytics are in conjunction with the growing number of freelance data science consulting opportunities; hence this book will provide insight on how to navigate this new terrain.This book uses a do-it-yourself approach to analytics and introduces tools that are easily available online and are non-programming based. Data science will allow SMEs to understand their customer loyalty, market segmentation, sales and revenue increase etc. more clearly. Data Science and Analytics for SMEs is particularly focused on small businesses and explores the analytics and data that can help them succeed further in their business. What You'll LearnCreate and measure the success of their analytics projectStart your business analytics consulting careerUse solutions taught in the book in practical uses cases and problems Who This Book Is ForBusiness analytics enthusiasts who are not particularly programming inclined, small business owners and data science consultants, data science and business students, and SME (small-to-medium enterprise) analysts

Data Science and Analytics Strategy: An Emergent Design Approach (Chapman & Hall/CRC Data Science Series)

by Kailash Awati Alexander Scriven

This book describes how to establish data science and analytics capabilities in organisations using Emergent Design, an evolutionary approach that increases the chances of successful outcomes while minimising upfront investment. Based on their experiences and those of a number of data leaders, the authors provide actionable advice on data technologies, processes, and governance structures so that readers can make choices that are appropriate to their organisational contexts and requirements. The book blends academic research on organisational change and data science processes with real-world stories from experienced data analytics leaders, focusing on the practical aspects of setting up a data capability. In addition to a detailed coverage of capability, culture, and technology choices, a unique feature of the book is its treatment of emerging issues such as data ethics and algorithmic fairness. Data Science and Analytics Strategy: An Emergent Design Approach has been written for professionals who are looking to build data science and analytics capabilities within their organisations as well as those who wish to expand their knowledge and advance their careers in the data space. Providing deep insights into the intersection between data science and business, this guide will help professionals understand how to help their organisations reap the benefits offered by data. Most importantly, readers will learn how to build a fit-for-purpose data science capability in a manner that avoids the most common pitfalls.

Data Science and Applications for Modern Power Systems (Power Electronics and Power Systems)

by Le Xie Yang Weng Ram Rajagopal

This book offers a comprehensive collection of research articles that utilize data—in particular large data sets—in modern power systems operation and planning. As the power industry moves towards actively utilizing distributed resources with advanced technologies and incentives, it is becoming increasingly important to benefit from the available heterogeneous data sets for improved decision-making. The authors present a first-of-its-kind comprehensive review of big data opportunities and challenges in the smart grid industry. This book provides succinct and useful theory, practical algorithms, and case studies to improve power grid operations and planning utilizing big data, making it a useful graduate-level reference for students, faculty, and practitioners on the future grid.

Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data

by Emc Education Services

Data Science and Big Data Analytics is about harnessing the power of data for new insights. The book covers the breadth of activities and methods and tools that Data Scientists use. The content focuses on concepts, principles and practical applications that are applicable to any industry and technology environment, and the learning is supported and explained with examples that you can replicate using open-source software.This book will help you:Become a contributor on a data science teamDeploy a structured lifecycle approach to data analytics problemsApply appropriate analytic techniques and tools to analyzing big dataLearn how to tell a compelling story with data to drive business actionPrepare for EMC Proven Professional Data Science Certification

Data Science and Big Data Analytics in Smart Environments

by Marta Chinnici; Florin Pop; Cătălin Negru

Most applications generate large datasets, like social networking and social influence programs, smart cities applications, smart house environments, Cloud applications, public web sites, scientific experiments and simulations, data warehouse, monitoring platforms, and e-government services. Data grows rapidly, since applications produce continuously increasing volumes of both unstructured and structured data. Large-scale interconnected systems aim to aggregate and efficiently exploit the power of widely distributed resources. In this context, major solutions for scalability, mobility, reliability, fault tolerance and security are required to achieve high performance and to create a smart environment. The impact on data processing, transfer and storage is the need to re-evaluate the approaches and solutions to better answer the user needs. A variety of solutions for specific applications and platforms exist so a thorough and systematic analysis of existing solutions for data science, data analytics, methods and algorithms used in Big Data processing and storage environments is significant in designing and implementing a smart environment.Fundamental issues pertaining to smart environments (smart cities, ambient assisted leaving, smart houses, green houses, cyber physical systems, etc.) are reviewed. Most of the current efforts still do not adequately address the heterogeneity of different distributed systems, the interoperability between them, and the systems resilience. This book will primarily encompass practical approaches that promote research in all aspects of data processing, data analytics, data processing in different type of systems: Cluster Computing, Grid Computing, Peer-to-Peer, Cloud/Edge/Fog Computing, all involving elements of heterogeneity, having a large variety of tools and software to manage them. The main role of resource management techniques in this domain is to create the suitable frameworks for development of applications and deployment in smart environments, with respect to high performance. The book focuses on topics covering algorithms, architectures, management models, high performance computing techniques and large-scale distributed systems.

Data Science and Digital Business

by Fausto Pedro García Márquez Benjamin Lev

This book combines the analytic principles of digital business and data science with business practice and big data. The interdisciplinary, contributed volume provides an interface between the main disciplines of engineering and technology and business administration. Written for managers, engineers and researchers who want to understand big data and develop new skills that are necessary in the digital business, it not only discusses the latest research, but also presents case studies demonstrating the successful application of data in the digital business.

Data Science and Machine Learning: Mathematical and Statistical Methods (Chapman And Hall/crc Machine Learning And Pattern Recognition Ser.)

by Dirk P. Kroese Zdravko Botev Thomas Taimre Radislav Vaisman

"This textbook is a well-rounded, rigorous, and informative work presenting the mathematics behind modern machine learning techniques. It hits all the right notes: the choice of topics is up-to-date and perfect for a course on data science for mathematics students at the advanced undergraduate or early graduate level. This book fills a sorely-needed gap in the existing literature by not sacrificing depth for breadth, presenting proofs of major theorems and subsequent derivations, as well as providing a copious amount of Python code. I only wish a book like this had been around when I first began my journey!" -Nicholas Hoell, University of Toronto "This is a well-written book that provides a deeper dive into data-scientific methods than many introductory texts. The writing is clear, and the text logically builds up regularization, classification, and decision trees. Compared to its probable competitors, it carves out a unique niche. -Adam Loy, Carleton College The purpose of Data Science and Machine Learning: Mathematical and Statistical Methods is to provide an accessible, yet comprehensive textbook intended for students interested in gaining a better understanding of the mathematics and statistics that underpin the rich variety of ideas and machine learning algorithms in data science. Key Features: Focuses on mathematical understanding. Presentation is self-contained, accessible, and comprehensive. Extensive list of exercises and worked-out examples. Many concrete algorithms with Python code. Full color throughout. Further Resources can be found on the authors website: https://github.com/DSML-book/Lectures

Data Science and Machine Learning Applications in Subsurface Engineering

by Daniel Asante Otchere

This book covers unsupervised learning, supervised learning, clustering approaches, feature engineering, explainable AI and multioutput regression models for subsurface engineering problems. Processing voluminous and complex data sets are the primary focus of the field of machine learning (ML). ML aims to develop data-driven methods and computational algorithms that can learn to identify complex and non-linear patterns to understand and predict the relationships between variables by analysing extensive data. Although ML models provide the final output for predictions, several steps need to be performed to achieve accurate predictions. These steps, data pre-processing, feature selection, feature engineering and outlier removal, are all contained in this book. New models are also developed using existing ML architecture and learning theories to improve the performance of traditional ML models and handle small and big data without manual adjustments. This research-oriented book will help subsurface engineers, geophysicists, and geoscientists become familiar with data science and ML advances relevant to subsurface engineering. Additionally, it demonstrates the use of data-driven approaches for salt identification, seismic interpretation, estimating enhanced oil recovery factor, predicting pore fluid types, petrophysical property prediction, estimating pressure drop in pipelines, bubble point pressure prediction, enhancing drilling mud loss, smart well completion and synthetic well log predictions.

Data Science and Machine Learning for Non-Programmers: Using SAS Enterprise Miner (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series)

by Dothang Truong

As data continues to grow exponentially, knowledge of data science and machine learning has become more crucial than ever. Machine learning has grown exponentially; however, the abundance of resources can be overwhelming, making it challenging for new learners. This book aims to address this disparity and cater to learners from various non-technical fields, enabling them to utilize machine learning effectively. Adopting a hands-on approach, readers are guided through practical implementations using real datasets and SAS Enterprise Miner, a user-friendly data mining software that requires no programming. Throughout the chapters, two large datasets are used consistently, allowing readers to practice all stages of the data mining process within a cohesive project framework. This book also provides specific guidelines and examples on presenting data mining results and reports, enhancing effective communication with stakeholders. Designed as a guiding companion for both beginners and experienced practitioners, this book targets a wide audience, including students, lecturers, researchers, and industry professionals from various backgrounds.

Data Science and Multiple Criteria Decision Making Approaches in Finance: Applications and Methods (Multiple Criteria Decision Making)

by Gökhan Silahtaroğlu Hasan Dinçer Serhat Yüksel

This book considers and assesses essential financial issues by utilizing data science and fuzzy multiple criteria decision making (MCDM) methods. It introduces readers to a range of data science methods, and demonstrates their application in the fields of business, health, economics, finance and engineering. In addition, it provides suggestions based on the assessment results on each topic, which can help to enhance the efficiency of the financial system and the sustainability of economic development. Given its scope, the book will help readers broaden their perspective on the assessment and evaluation of financial issues using data science and MCDM approaches.

Data Science and Predictive Analytics: Biomedical And Health Applications Using R

by Ivo D. Dinov

Over the past decade, Big Data have become ubiquitous in all economic sectors, scientific disciplines, and human activities. They have led to striking technological advances, affecting all human experiences. Our ability to manage, understand, interrogate, and interpret such extremely large, multisource, heterogeneous, incomplete, multiscale, and incongruent data has not kept pace with the rapid increase of the volume, complexity and proliferation of the deluge of digital information. There are three reasons for this shortfall. First, the volume of data is increasing much faster than the corresponding rise of our computational processing power (Kryder’s law > Moore’s law). Second, traditional discipline-bounds inhibit expeditious progress. Third, our education and training activities have fallen behind the accelerated trend of scientific, information, and communication advances. There are very few rigorous instructional resources, interactive learning materials, and dynamic training environments that support active data science learning. The textbook balances the mathematical foundations with dexterous demonstrations and examples of data, tools, modules and workflows that serve as pillars for the urgently needed bridge to close that supply and demand predictive analytic skills gap. Exposing the enormous opportunities presented by the tsunami of Big data, this textbook aims to identify specific knowledge gaps, educational barriers, and workforce readiness deficiencies. Specifically, it focuses on the development of a transdisciplinary curriculum integrating modern computational methods, advanced data science techniques, innovative biomedical applications, and impactful health analytics. The content of this graduate-level textbook fills a substantial gap in integrating modern engineering concepts, computational algorithms, mathematical optimization, statistical computing and biomedical inference. Big data analytic techniques and predictive scientific methods demand broad transdisciplinary knowledge, appeal to an extremely wide spectrum of readers/learners, and provide incredible opportunities for engagement throughout the academy, industry, regulatory and funding agencies. The two examples below demonstrate the powerful need for scientific knowledge, computational abilities, interdisciplinary expertise, and modern technologies necessary to achieve desired outcomes (improving human health and optimizing future return on investment). This can only be achieved by appropriately trained teams of researchers who can develop robust decision support systems using modern techniques and effective end-to-end protocols, like the ones described in this textbook. • A geriatric neurologist is examining a patient complaining of gait imbalance and posture instability. To determine if the patient may suffer from Parkinson’s disease, the physician acquires clinical, cognitive, phenotypic, imaging, and genetics data (Big Data). Most clinics and healthcare centers are not equipped with skilled data analytic teams that can wrangle, harmonize and interpret such complex datasets. A learner that completes a course of study using this textbook will have the competency and ability to manage the data, generate a protocol for deriving biomarkers, and provide an actionable decision support system. The results of this protocol will help the physician understand the entire patient dataset and assist in making a holistic evidence-based, data-driven, clinical diagnosis.• To improve the return on investment for their shareholders, a healthcare manufacturer needs to forecast the demand for their product subject to environmental, demographic, economic, and bio-social sentiment data (Big Data). The organization’s data-analytics team is tasked with developing a protocol that identifies, aggregates, harmonizes, models and analyzes these heterogeneous data elements to generate a trend forecast. This system needs to provide an automated, adaptive, scalable, and reliable prediction of the optimal investment, e.g., R&D allocation, that maximizes the company’s bot

Data Science and Productivity Analytics (International Series in Operations Research & Management Science #290)

by Vincent Charles Juan Aparicio Joe Zhu

This book includes a spectrum of concepts, such as performance, productivity, operations research, econometrics, and data science, for the practically and theoretically important areas of ‘productivity analysis/data envelopment analysis’ and ‘data science/big data’. Data science is defined as the collection of scientific methods, processes, and systems dedicated to extracting knowledge or insights from data and it develops on concepts from various domains, containing mathematics and statistical methods, operations research, machine learning, computer programming, pattern recognition, and data visualisation, among others. Examples of data science techniques include linear and logistic regressions, decision trees, Naïve Bayesian classifier, principal component analysis, neural networks, predictive modelling, deep learning, text analysis, survival analysis, and so on, all of which allow using the data to make more intelligent decisions. On the other hand, it is without a doubt that nowadays the amount of data is exponentially increasing, and analysing large data sets has become a key basis of competition and innovation, underpinning new waves of productivity growth. This book aims to bring a fresh look onto the various ways that data science techniques could unleash value and drive productivity from these mountains of data.Researchers working in productivity analysis/data envelopment analysis will benefit from learning about the tools available in data science/big data that can be used in their current research analyses and endeavours. The data scientists, on the other hand, will also get benefit from learning about the plethora of applications available in productivity analysis/data envelopment analysis.

Data Science and SDGs: Challenges, Opportunities and Realities

by Bikas Kumar Sinha Md. Nurul Haque Mollah

The book presents contributions on statistical models and methods applied, for both data science and SDGs, in one place. Measuring and controlling data of SDGs, data driven measurement of progress needs to be distributed to stakeholders. In this situation, the techniques used in data science, specially, in the big data analytics, play an important role rather than the traditional data gathering and manipulation techniques. This book fills this space through its twenty contributions. The contributions have been selected from those presented during the 7th International Conference on Data Science and Sustainable Development Goals organized by the Department of Statistics, University of Rajshahi, Bangladesh; and cover topics mainly on SDGs, bioinformatics, public health, medical informatics, environmental statistics, data science and machine learning. The contents of the volume would be useful to policymakers, researchers, government entities, civil society, and nonprofit organizations for monitoring and accelerating the progress of SDGs.

Data Science anwenden: Einführung, Anwendungen und Projekte (Angewandte Wirtschaftsinformatik)

by Thomas Barton Christian Müller

Dieses Buch bietet einen Einstieg in das Thema Data Science auf Basis der visuellen Aufbereitung von Daten. Es hat ethische Betrachtungen in der digitalen Transformation zum Gegenstand und stellt ein Prozessrahmenwerk für die Bewertung von Technologien vor. Außerdem erläutert es Besonderheiten und Erkenntnisse zum Scheitern von Data-Science-Projekten und stellt Empfehlungssysteme unter Berücksichtigung aktueller Entwicklungen vor. Funktionalität zu Machine Learning in Werkzeugen zu Business Analytics wird verglichen und der Einsatz eines Vorgehensmodells für Data Science aufgezeigt.Die Integration erneuerbarer Energien am Beispiel von Photovoltaikanlagen, ein effizienterer Umgang mit Wärmeenergie, wissenschaftliche Literaturauswertung, Kundenzufriedenheit in der Automobilindustrie und ein Framework für die Analyse von Fahrzeugdaten dienen als Anwendungsbeispiele für den konkreten Einsatz von Data Science. Das Buch bietet wichtige Informationen, die für Praktiker ebenso relevant sind wie für Studierende und Lehrende.

Data Science at Target

by Caitlin N. Bowler Srikant M. Datar

<p>Paritosh Desai joined Target.com in 2013 as VP of Business Intelligence, Analytics & Testing to explore how the retailer could use its relatively small but thriving e-commerce arm to drive sales and win customers. The case explores the technological and organizational challenges Desai faced and the trade offs he considered in his four-year journey to develop the larger retail business into a data science organization. <p>Professor Srikant M. Datar and Research Associate Caitlin N. Bowler prepared this case. It was reviewed and approved before publication by a company designate. Funding for the development of this case was provided by Harvard Business School and not by the company. The citation review for this case has not yet been completed. HBS cases are developed solely as the basis for class discussion. Cases are not intended to serve as endorsements, sources of primary data, or illustrations of effective or ineffective management.</p>

Data Science at the Warriors

by Iavor Bojinov Michael Parzen

An introductory case for a data science course, which provides an overview of the data science pipeline.

Data Science Concepts and Techniques with Applications

by Usman Qamar Muhammad Summair Raza

This book comprehensively covers the topic of data science. Data science is an umbrella term that encompasses data analytics, data mining, machine learning, and several other related disciplines. This book synthesizes both fundamental and advanced topics of a research area that has now reached maturity. The chapters of this book are organized into three sections:The first section is an introduction to data science. Starting from the basic concepts, the book will highlight the types of data, its use, its importance and issues that are normally faced in data analytics. Followed by discussion on wide range of applications of data science and widely used techniques in data science.The second section is devoted to the tools and techniques of data science. It consists of data pre-processing, feature selection, classification and clustering concepts as well as an introduction to text mining and opining mining.And finally, the third section of the book focuses on two programming languages commonly used for data science projects i.e. Python and R programming language.Although this book primarily serves as a textbook, it will also appeal to industrial practitioners and researchers due to its focus on applications and references. The book is suitable for both undergraduate and postgraduate students as well as those carrying out research in data science. It can be used as a textbook for undergraduate students in computer science, engineering and mathematics. It can also be accessible to undergraduate students from other areas with the adequate background. The more advanced chapters can be used by postgraduate researchers intending to gather a deeper theoretical understanding.

Data Science Essentials in Python: Collect - Organize - Explore - Predict - Value

by Dmitry Zinoviev

Go from messy, unstructured artifacts stored in SQL and NoSQL databases to a neat, well-organized dataset with this quick reference for the busy data scientist. Understand text mining, machine learning, and network analysis; process numeric data with the NumPy and Pandas modules; describe and analyze data using statistical and network-theoretical methods; and see actual examples of data analysis at work. This one-stop solution covers the essential data science you need in Python.Data science is one of the fastest-growing disciplines in terms of academic research, student enrollment, and employment. Python, with its flexibility and scalability, is quickly overtaking the R language for data-scientific projects. Keep Python data-science concepts at your fingertips with this modular, quick reference to the tools used to acquire, clean, analyze, and store data.This one-stop solution covers essential Python, databases, network analysis, natural language processing, elements of machine learning, and visualization. Access structured and unstructured text and numeric data from local files, databases, and the Internet. Arrange, rearrange, and clean the data. Work with relational and non-relational databases, data visualization, and simple predictive analysis (regressions, clustering, and decision trees). See how typical data analysis problems are handled. And try your hand at your own solutions to a variety of medium-scale projects that are fun to work on and look good on your resume.Keep this handy quick guide at your side whether you're a student, an entry-level data science professional converting from R to Python, or a seasoned Python developer who doesn't want to memorize every function and option.What You Need:You need a decent distribution of Python 3.3 or above that includes at least NLTK, Pandas, NumPy, Matplotlib, Networkx, SciKit-Learn, and BeautifulSoup. A great distribution that meets the requirements is Anaconda, available for free from www.continuum.io. If you plan to set up your own database servers, you also need MySQL (www.mysql.com) and MongoDB (www.mongodb.com). Both packages are free and run on Windows, Linux, and Mac OS.

Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking

by Tom Fawcett Foster Provost

Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today.Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You'll not only learn how to improve communication between business stakeholders and data scientists, but also how participate intelligently in your company's data science projects. You'll also discover how to think data-analytically, and fully appreciate how data science methods can support business decision-making.Understand how data science fits in your organization--and how you can use it for competitive advantageTreat data as a business asset that requires careful investment if you're to gain real valueApproach business problems data-analytically, using the data-mining process to gather good data in the most appropriate wayLearn general concepts for actually extracting knowledge from dataApply data science principles when interviewing data science job candidates

Data Science for Business With R

by Jeffrey S. Saltz Jeffrey Morgan Stanton

Data Science for Business with R, written by Jeffrey S. Saltz and Jeffrey M. Stanton, focuses on the concepts foundational for students starting a business analytics or data science degree program. To keep the book practical and applied, the authors feature a running case using a global airline business’s customer survey dataset to illustrate how to turn data in business decisions, in addition to numerous examples throughout. To aid in usability beyond the classroom, the text features full integration of freely-available R and RStudio software, one of the most popular data science tools available. Designed for students with little to no experience in related areas like computer science, the book chapters follow a logical order from introduction and installation of R and RStudio, working with data architecture, undertaking data collection, performing data analysis, and transitioning to data archiving and presentation. Each chapter follows a familiar structure, starting with learning objectives and background, following the basic steps of functions alongside simple examples, applying these functions to the case study, and ending with chapter challenge questions, sources, and a list of R functions so students know what to expect in each step of their data science course. Data Science for Business with R provides readers with a straightforward and applied guide to this new and evolving field.

Refine Search

Showing 25,076 through 25,100 of 100,000 results