Browse Results

Showing 15,026 through 15,050 of 58,307 results

Data Science and Information Security: First International Artificial Intelligence Conference, IAIC 2023, Nanjing, China, November 25–27, 2023, Revised Selected Papers, Part II (Communications in Computer and Information Science #2059)

by Hai Jin Yi Pan Jianfeng Lu

This 3-volume set, CCIS 2058-2060 constitutes the First International Conference, on Artificial Intelligence, IAIC 2023, held in Nanjing, China, in November 2023. The 85 full papers presented were carefully reviewed and selected from 428 submissions. The papers are clustered in parts on: Artificial Intelligence and Machine Learning; Data Security and information Security; Computer Networks and IoT. The papers present recent research and developments in artificial intelligence and its applications in machine learning, natural language processing, computer vision, robotics, and ethical considerations.

Data Science and Innovations for Intelligent Systems: Computational Excellence and Society 5.0 (Demystifying Technologies for Computational Excellence)

by Kavita Taneja

Data science is an emerging field and innovations in it need to be explored for the success of society 5.0. This book not only focuses on the practical applications of data science to achieve computational excellence, but also digs deep into the issues and implications of intelligent systems. This book highlights innovations in data science to achieve computational excellence that can optimize performance of smart applications. The book focuses on methodologies, framework, design issues, tools, architectures, and technologies necessary to develop and understand data science and its emerging applications in the present era. This book will be useful for the research community, start-up entrepreneurs, academicians, and data centered industries and professors that are interested in exploring innovations in varied applications and areas of data science.

Data Science and Intelligent Applications: Proceedings of ICDSIA 2020 (Lecture Notes on Data Engineering and Communications Technologies #52)

by Ketan Kotecha Vincenzo Piuri Hetalkumar N. Shah Rajan Patel

This book includes selected papers from the International Conference on Data Science and Intelligent Applications (ICDSIA 2020), hosted by Gandhinagar Institute of Technology (GIT), Gujarat, India, on January 24–25, 2020. The proceedings present original and high-quality contributions on theory and practice concerning emerging technologies in the areas of data science and intelligent applications. The conference provides a forum for researchers from academia and industry to present and share their ideas, views and results, while also helping them approach the challenges of technological advancements from different viewpoints. The contributions cover a broad range of topics, including: collective intelligence, intelligent systems, IoT, fuzzy systems, Bayesian networks, ant colony optimization, data privacy and security, data mining, data warehousing, big data analytics, cloud computing, natural language processing, swarm intelligence, speech processing, machine learning and deep learning, and intelligent applications and systems. Helping strengthen the links between academia and industry, the book offers a valuable resource for instructors, students, industry practitioners, engineers, managers, researchers, and scientists alike.

Data Science and Intelligent Systems: Proceedings of 5th Computational Methods in Systems and Software 2021, Vol. 2 (Lecture Notes in Networks and Systems #231)

by Radek Silhavy Petr Silhavy Zdenka Prokopova

This book constitutes the second part of refereed proceedings of the 5th Computational Methods in Systems and Software 2021 (CoMeSySo 2021) proceedings. The real-world problems related to data science and algorithm design related to systems and software engineering are presented in this papers. Furthermore, the basic research’ papers that describe novel approaches in the data science, algorithm design and in systems and software engineering are included.The CoMeSySo 2021 conference is breaking the barriers, being held online. CoMeSySo 2021 intends to provide an international forum for the discussion of the latest high-quality research results

Data Science and Internet of Things: Research and Applications at the Intersection of DS and IoT (Internet of Things)

by Giancarlo Fortino Antonio Liotta Raffaele Gravina Alessandro Longheu

This book focuses on the combination of IoT and data science, in particular how methods, algorithms, and tools from data science can effectively support IoT. The authors show how data science methodologies, techniques and tools, can translate data into information, enabling the effectiveness and usefulness of new services offered by IoT stakeholders. The authors posit that if IoT is indeed the infrastructure of the future, data structure is the key that can lead to a significant improvement of human life. The book aims to present innovative IoT applications as well as ongoing research that exploit modern data science approaches. Readers are offered issues and challenges in a cross-disciplinary scenario that involves both IoT and data science fields. The book features contributions from academics, researchers, and professionals from both fields.

Data Science and Its Applications

by Aakanksha Sharaff G. R. Sinha

The term "data" being mostly used, experimented, analyzed, and researched, "Data Science and its Applications" finds relevance in all domains of research studies including science, engineering, technology, management, mathematics, and many more in wide range of applications such as sentiment analysis, social medial analytics, signal processing, gene analysis, market analysis, healthcare, bioinformatics etc. The book on Data Science and its applications discusses about data science overview, scientific methods, data processing, extraction of meaningful information from data, and insight for developing the concept from different domains, highlighting mathematical and statistical models, operations research, computer programming, machine learning, data visualization, pattern recognition and others. The book also highlights data science implementation and evaluation of performance in several emerging applications such as information retrieval, cognitive science, healthcare, and computer vision. The data analysis covers the role of data science depicting different types of data such as text, image, biomedical signal etc. useful for a wide range of real time applications. The salient features of the book are: Overview, Challenges and Opportunities in Data Science and Real Time Applications Addressing Big Data Issues Useful Machine Learning Methods Disease Detection and Healthcare Applications utilizing Data Science Concepts and Deep Learning Applications in Stock Market, Education, Behavior Analysis, Image Captioning, Gene Analysis and Scene Text Analysis Data Optimization Due to multidisciplinary applications of data science concepts, the book is intended for wide range of readers that include Data Scientists, Big Data Analysists, Research Scholars engaged in Data Science and Machine Learning applications.

Data Science and Machine Learning: 21st Australasian Conference, AusDM 2023, Auckland, New Zealand, December 11–13, 2023, Proceedings (Communications in Computer and Information Science #1943)

by Diana Benavides-Prado Sarah Erfani Philippe Fournier-Viger Yee Ling Boo Yun Sing Koh

This book constitutes the proceedings of the 21st Australasian Conference on Data Science and Machine Learning, AusDM 2023, held in Auckland, New Zealand, during December 11–13, 2023.The 20 full papers presented in this book were carefully reviewed and selected from 50 submissions. The papers are organized in the following topical sections: research track and application track. They deal with topics around data science and machine learning in everyday life.

Data Science and Machine Learning: Mathematical and Statistical Methods (Chapman And Hall/crc Machine Learning And Pattern Recognition Ser.)

by Dirk P. Kroese Zdravko Botev Thomas Taimre Radislav Vaisman

"This textbook is a well-rounded, rigorous, and informative work presenting the mathematics behind modern machine learning techniques. It hits all the right notes: the choice of topics is up-to-date and perfect for a course on data science for mathematics students at the advanced undergraduate or early graduate level. This book fills a sorely-needed gap in the existing literature by not sacrificing depth for breadth, presenting proofs of major theorems and subsequent derivations, as well as providing a copious amount of Python code. I only wish a book like this had been around when I first began my journey!" -Nicholas Hoell, University of Toronto "This is a well-written book that provides a deeper dive into data-scientific methods than many introductory texts. The writing is clear, and the text logically builds up regularization, classification, and decision trees. Compared to its probable competitors, it carves out a unique niche. -Adam Loy, Carleton College The purpose of Data Science and Machine Learning: Mathematical and Statistical Methods is to provide an accessible, yet comprehensive textbook intended for students interested in gaining a better understanding of the mathematics and statistics that underpin the rich variety of ideas and machine learning algorithms in data science. Key Features: Focuses on mathematical understanding. Presentation is self-contained, accessible, and comprehensive. Extensive list of exercises and worked-out examples. Many concrete algorithms with Python code. Full color throughout. Further Resources can be found on the authors website: https://github.com/DSML-book/Lectures

Data Science and Machine Learning for Non-Programmers: Using SAS Enterprise Miner (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series)

by Dothang Truong

As data continues to grow exponentially, knowledge of data science and machine learning has become more crucial than ever. Machine learning has grown exponentially; however, the abundance of resources can be overwhelming, making it challenging for new learners. This book aims to address this disparity and cater to learners from various non-technical fields, enabling them to utilize machine learning effectively. Adopting a hands-on approach, readers are guided through practical implementations using real datasets and SAS Enterprise Miner, a user-friendly data mining software that requires no programming. Throughout the chapters, two large datasets are used consistently, allowing readers to practice all stages of the data mining process within a cohesive project framework. This book also provides specific guidelines and examples on presenting data mining results and reports, enhancing effective communication with stakeholders. Designed as a guiding companion for both beginners and experienced practitioners, this book targets a wide audience, including students, lecturers, researchers, and industry professionals from various backgrounds.

Data Science and Medical Informatics in Healthcare Technologies (SpringerBriefs in Applied Sciences and Technology)

by Nguyen Thi Dieu Linh Zhongyu (Joan) Lu

This book highlights a timely and accurate insight at the endeavour of the bioinformatics and genomics clinicians from industry and academia to address the societal needs. The contents of the book unearth the lacuna between the medication and treatment in the current preventive medicinal and pharmaceutical system. It contains chapters prepared by experts in life sciences along with data scientists for examining the circumstances of health care system for the next decade. It also highlights the automated processes for analyzing data in clinical trial research, specifically for drug development. Additionally, the data science solutions provided in this book help pharmaceutical companies to improve on what had historically been manual, costly and laborious process for cross-referencing research in clinical trials on drug development, while laying the groundwork for use with a full range of other drugs for the conditions ranging from tuberculosis, to diabetes, to heart attacks and many others.

Data Science and Multiple Criteria Decision Making Approaches in Finance: Applications and Methods (Multiple Criteria Decision Making)

by Gökhan Silahtaroğlu Hasan Dinçer Serhat Yüksel

This book considers and assesses essential financial issues by utilizing data science and fuzzy multiple criteria decision making (MCDM) methods. It introduces readers to a range of data science methods, and demonstrates their application in the fields of business, health, economics, finance and engineering. In addition, it provides suggestions based on the assessment results on each topic, which can help to enhance the efficiency of the financial system and the sustainability of economic development. Given its scope, the book will help readers broaden their perspective on the assessment and evaluation of financial issues using data science and MCDM approaches.

Data Science and Network Engineering: Proceedings of ICDSNE 2023 (Lecture Notes in Networks and Systems #791)

by Suyel Namasudra Munesh Chandra Trivedi Ruben Gonzalez Crespo Pascal Lorenz

This book includes research papers presented at the International Conference on Data Science and Network Engineering (ICDSNE 2023) organized by the Department of Computer Science and Engineering, National Institute of Technology Agartala, Tripura, India, during July 21–22, 2023. It includes research works from researchers, academicians, business executives, and industry professionals for solving real-life problems by using the advancements and applications of data science and network engineering. This book covers many advanced topics, such as artificial intelligence (AI), machine learning (ML), deep learning (DL), computer networks, blockchain, security and privacy, Internet of things (IoT), cloud computing, big data, supply chain management, and many more. Different sections of this book are highly beneficial for the researchers, who are working in the field of data science and network engineering.

Data Science and Predictive Analytics: Biomedical And Health Applications Using R

by Ivo D. Dinov

Over the past decade, Big Data have become ubiquitous in all economic sectors, scientific disciplines, and human activities. They have led to striking technological advances, affecting all human experiences. Our ability to manage, understand, interrogate, and interpret such extremely large, multisource, heterogeneous, incomplete, multiscale, and incongruent data has not kept pace with the rapid increase of the volume, complexity and proliferation of the deluge of digital information. There are three reasons for this shortfall. First, the volume of data is increasing much faster than the corresponding rise of our computational processing power (Kryder’s law > Moore’s law). Second, traditional discipline-bounds inhibit expeditious progress. Third, our education and training activities have fallen behind the accelerated trend of scientific, information, and communication advances. There are very few rigorous instructional resources, interactive learning materials, and dynamic training environments that support active data science learning. The textbook balances the mathematical foundations with dexterous demonstrations and examples of data, tools, modules and workflows that serve as pillars for the urgently needed bridge to close that supply and demand predictive analytic skills gap. Exposing the enormous opportunities presented by the tsunami of Big data, this textbook aims to identify specific knowledge gaps, educational barriers, and workforce readiness deficiencies. Specifically, it focuses on the development of a transdisciplinary curriculum integrating modern computational methods, advanced data science techniques, innovative biomedical applications, and impactful health analytics. The content of this graduate-level textbook fills a substantial gap in integrating modern engineering concepts, computational algorithms, mathematical optimization, statistical computing and biomedical inference. Big data analytic techniques and predictive scientific methods demand broad transdisciplinary knowledge, appeal to an extremely wide spectrum of readers/learners, and provide incredible opportunities for engagement throughout the academy, industry, regulatory and funding agencies. The two examples below demonstrate the powerful need for scientific knowledge, computational abilities, interdisciplinary expertise, and modern technologies necessary to achieve desired outcomes (improving human health and optimizing future return on investment). This can only be achieved by appropriately trained teams of researchers who can develop robust decision support systems using modern techniques and effective end-to-end protocols, like the ones described in this textbook. • A geriatric neurologist is examining a patient complaining of gait imbalance and posture instability. To determine if the patient may suffer from Parkinson’s disease, the physician acquires clinical, cognitive, phenotypic, imaging, and genetics data (Big Data). Most clinics and healthcare centers are not equipped with skilled data analytic teams that can wrangle, harmonize and interpret such complex datasets. A learner that completes a course of study using this textbook will have the competency and ability to manage the data, generate a protocol for deriving biomarkers, and provide an actionable decision support system. The results of this protocol will help the physician understand the entire patient dataset and assist in making a holistic evidence-based, data-driven, clinical diagnosis.• To improve the return on investment for their shareholders, a healthcare manufacturer needs to forecast the demand for their product subject to environmental, demographic, economic, and bio-social sentiment data (Big Data). The organization’s data-analytics team is tasked with developing a protocol that identifies, aggregates, harmonizes, models and analyzes these heterogeneous data elements to generate a trend forecast. This system needs to provide an automated, adaptive, scalable, and reliable prediction of the optimal investment, e.g., R&D allocation, that maximizes the company’s bot

Data Science and Predictive Analytics: Biomedical and Health Applications using R (The Springer Series in Applied Machine Learning)

by Ivo D. Dinov

This textbook integrates important mathematical foundations, efficient computational algorithms, applied statistical inference techniques, and cutting-edge machine learning approaches to address a wide range of crucial biomedical informatics, health analytics applications, and decision science challenges. Each concept in the book includes a rigorous symbolic formulation coupled with computational algorithms and complete end-to-end pipeline protocols implemented as functional R electronic markdown notebooks. These workflows support active learning and demonstrate comprehensive data manipulations, interactive visualizations, and sophisticated analytics. The content includes open problems, state-of-the-art scientific knowledge, ethical integration of heterogeneous scientific tools, and procedures for systematic validation and dissemination of reproducible research findings.Complementary to the enormous challenges related to handling, interrogating, and understanding massive amounts of complex structured and unstructured data, there are unique opportunities that come with access to a wealth of feature-rich, high-dimensional, and time-varying information. The topics covered in Data Science and Predictive Analytics address specific knowledge gaps, resolve educational barriers, and mitigate workforce information-readiness and data science deficiencies. Specifically, it provides a transdisciplinary curriculum integrating core mathematical principles, modern computational methods, advanced data science techniques, model-based machine learning, model-free artificial intelligence, and innovative biomedical applications. The book’s fourteen chapters start with an introduction and progressively build foundational skills from visualization to linear modeling, dimensionality reduction, supervised classification, black-box machine learning techniques, qualitative learning methods, unsupervised clustering, model performance assessment, feature selection strategies, longitudinal data analytics, optimization, neural networks, and deep learning. The second edition of the book includes additional learning-based strategies utilizing generative adversarial networks, transfer learning, and synthetic data generation, as well as eight complementary electronic appendices. This textbook is suitable for formal didactic instructor-guided course education, as well as for individual or team-supported self-learning. The material is presented at the upper-division and graduate-level college courses and covers applied and interdisciplinary mathematics, contemporary learning-based data science techniques, computational algorithm development, optimization theory, statistical computing, and biomedical sciences. The analytical techniques and predictive scientific methods described in the book may be useful to a wide range of readers, formal and informal learners, college instructors, researchers, and engineers throughout the academy, industry, government, regulatory, funding, and policy agencies. The supporting book website provides many examples, datasets, functional scripts, complete electronic notebooks, extensive appendices, and additional materials.

Data Science and SDGs: Challenges, Opportunities and Realities

by Bikas Kumar Sinha Md. Nurul Haque Mollah

The book presents contributions on statistical models and methods applied, for both data science and SDGs, in one place. Measuring and controlling data of SDGs, data driven measurement of progress needs to be distributed to stakeholders. In this situation, the techniques used in data science, specially, in the big data analytics, play an important role rather than the traditional data gathering and manipulation techniques. This book fills this space through its twenty contributions. The contributions have been selected from those presented during the 7th International Conference on Data Science and Sustainable Development Goals organized by the Department of Statistics, University of Rajshahi, Bangladesh; and cover topics mainly on SDGs, bioinformatics, public health, medical informatics, environmental statistics, data science and machine learning. The contents of the volume would be useful to policymakers, researchers, government entities, civil society, and nonprofit organizations for monitoring and accelerating the progress of SDGs.

Data Science and Security: Proceedings of IDSCS 2023 (Lecture Notes in Networks and Systems #922)

by Durgesh Kumar Mishra Hiroki Sayama Samiksha Shukla Joseph Varghese Kureethara

This book presents best-selected papers presented at the International Conference on Data Science for Computational Security (IDSCS 2023), organized by the Department of Data Science, CHRIST (Deemed to be University), Pune Lavasa Campus, India, from 02–04 November, 2023. The proceeding targets the current research works in the areas of data science, data security, data analytics, artificial intelligence, machine learning, computer vision, algorithms design, computer networking, data mining, big data, text mining, knowledge representation, soft computing, and cloud computing.

Data Science and Security: Proceedings of IDSCS 2020 (Lecture Notes in Networks and Systems #132)

by Durgesh Kumar Mishra Aynur Unal Dharm Singh Jat Samiksha Shukla

This book presents best selected papers presented at the International Conference on Data Science for Computational Security (IDSCS 2020), organized by the Department of Data Science, CHRIST (Deemed to be University), Pune Lavasa Campus, India, during 13–14 March 2020. The proceeding will be targeting the current research works in the areas of data science, data security, data analytics, artificial intelligence, machine learning, computer vision, algorithms design, computer networking, data mining, big data, text mining, knowledge representation, soft computing and cloud computing.

Data Science and Security: Proceedings of IDSCS 2021 (Lecture Notes in Networks and Systems #290)

by Durgesh Kumar Mishra Aynur Unal Samiksha Shukla Joseph Varghese Kureethara Dong Seog Han

This book presents the best-selected papers presented at the International Conference on Data Science, Computation and Security (IDSCS-2021), organized by the Department of Data Science, CHRIST (Deemed to be University), Pune Lavasa Campus, India, during April 16–17, 2021. The proceeding is targeting the current research works in the areas of data science, data security, data analytics, artificial intelligence, machine learning, computer vision, algorithms design, computer networking, data mining, big data, text mining, knowledge representation, soft computing, and cloud computing.

Data Science and Security: Proceedings of IDSCS 2022 (Lecture Notes in Networks and Systems #462)

by Samiksha Shukla Xiao-Zhi Gao Joseph Varghese Kureethara Durgesh Mishra

This book presents best selected papers presented at the International Conference on Data Science for Computational Security (IDSCS 2022), organized by the Department of Data Science, CHRIST (Deemed to be University), Pune Lavasa Campus, India, during 11 – 12 February 2022. The book proposes new technologies and discusses future solutions and applications of data science, data analytics and security. The book targets current research works in the areas of data science, data security, data analytics, artificial intelligence, machine learning, computer vision, algorithms design, computer networking, data mining, big data, text mining, knowledge representation, soft computing and cloud computing.

Data Science and Social Research II: Methods, Technologies and Applications (Studies in Classification, Data Analysis, and Knowledge Organization)

by Paolo Mariani Mariangela Zenga

The peer-reviewed contributions gathered in this book address methods, software and applications of statistics and data science in the social sciences. The data revolution in social science research has not only produced new business models, but has also provided policymakers with better decision-making support tools. In this volume, statisticians, computer scientists and experts on social research discuss the opportunities and challenges of the social data revolution in order to pave the way for addressing new research problems. The respective contributions focus on complex social systems and current methodological advances in extracting social knowledge from large data sets, as well as modern social research on human behavior and society using large data sets. Moreover, they analyze integrated systems designed to take advantage of new social data sources, and discuss quality-related issues. The papers were originally presented at the 2nd International Conference on Data Science and Social Research, held in Milan, Italy, on February 4-5, 2019.

Data Science and Visual Computing (Advanced Information and Knowledge Processing)

by Rae Earnshaw John Dill David Kasik

Data science addresses the need to extract knowledge and information from data volumes, often from real-time sources in a wide variety of disciplines such as astronomy, bioinformatics, engineering, science, medicine, social science, business, and the humanities. The range and volume of data sources has increased enormously over time, particularly those generating real-time data. This has posed additional challenges for data management and data analysis of the data and effective representation and display. A wide range of application areas are able to benefit from the latest visual tools and facilities. Rapid analysis is needed in areas where immediate decisions need to be made. Such areas include weather forecasting, the stock exchange, and security threats. In areas where the volume of data being produced far exceeds the current capacity to analyze all of it, attention is being focussed how best to address these challenges. Optimum ways of addressing large data sets across a variety of disciplines have led to the formation of national and institutional Data Science Institutes and Centers. Being driven by national priority, they are able to attract support for research and development within their organizations and institutions to bring together interdisciplinary expertise to address a wide variety of problems. Visual computing is a set of tools and methodologies that utilize 2D and 3D images to extract information from data. Such methods include data analysis, simulation, and interactive exploration. These are analyzed and discussed.

Data Science anwenden: Einführung, Anwendungen und Projekte (Angewandte Wirtschaftsinformatik)

by Thomas Barton Christian Müller

Dieses Buch bietet einen Einstieg in das Thema Data Science auf Basis der visuellen Aufbereitung von Daten. Es hat ethische Betrachtungen in der digitalen Transformation zum Gegenstand und stellt ein Prozessrahmenwerk für die Bewertung von Technologien vor. Außerdem erläutert es Besonderheiten und Erkenntnisse zum Scheitern von Data-Science-Projekten und stellt Empfehlungssysteme unter Berücksichtigung aktueller Entwicklungen vor. Funktionalität zu Machine Learning in Werkzeugen zu Business Analytics wird verglichen und der Einsatz eines Vorgehensmodells für Data Science aufgezeigt.Die Integration erneuerbarer Energien am Beispiel von Photovoltaikanlagen, ein effizienterer Umgang mit Wärmeenergie, wissenschaftliche Literaturauswertung, Kundenzufriedenheit in der Automobilindustrie und ein Framework für die Analyse von Fahrzeugdaten dienen als Anwendungsbeispiele für den konkreten Einsatz von Data Science. Das Buch bietet wichtige Informationen, die für Praktiker ebenso relevant sind wie für Studierende und Lehrende.

Data Science at Target

by Caitlin N. Bowler Srikant M. Datar

<p>Paritosh Desai joined Target.com in 2013 as VP of Business Intelligence, Analytics & Testing to explore how the retailer could use its relatively small but thriving e-commerce arm to drive sales and win customers. The case explores the technological and organizational challenges Desai faced and the trade offs he considered in his four-year journey to develop the larger retail business into a data science organization. <p>Professor Srikant M. Datar and Research Associate Caitlin N. Bowler prepared this case. It was reviewed and approved before publication by a company designate. Funding for the development of this case was provided by Harvard Business School and not by the company. The citation review for this case has not yet been completed. HBS cases are developed solely as the basis for class discussion. Cases are not intended to serve as endorsements, sources of primary data, or illustrations of effective or ineffective management.</p>

Data Science at the Command Line

by Jeroen Janssens

This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data.To get you started--whether you're on Windows, OS X, or Linux--author Jeroen Janssens introduces the Data Science Toolbox, an easy-to-install virtual environment packed with over 80 command-line tools.Discover why the command line is an agile, scalable, and extensible technology. Even if you're already comfortable processing data with, say, Python or R, you'll greatly improve your data science workflow by also leveraging the power of the command line.Obtain data from websites, APIs, databases, and spreadsheetsPerform scrub operations on plain text, CSV, HTML/XML, and JSONExplore data, compute descriptive statistics, and create visualizationsManage your data science workflow using DrakeCreate reusable tools from one-liners and existing Python or R codeParallelize and distribute data-intensive pipelines using GNU ParallelModel data with dimensionality reduction, clustering, regression, and classification algorithms

Data Science at the Command Line

by Jeroen Janssens

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 80 tools--useful whether you work with Windows, macOS, or Linux.You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, and engineers; software and machine learning engineers; and system administrators.Obtain data from websites, APIs, databases, and spreadsheetsPerform scrub operations on text, CSV, HTM, XML, and JSON filesExplore data, compute descriptive statistics, and create visualizationsManage your data science workflowCreate reusable command-line tools from one-liners and existing Python or R codeParallelize and distribute data-intensive pipelinesModel data with dimensionality reduction, clustering, regression, and classification algorithms

Refine Search

Showing 15,026 through 15,050 of 58,307 results