- Table View
- List View
Statistical Language and Speech Processing: 9th International Conference, SLSP 2021, Virtual Event, November 22-26, 2021, Proceedings (Lecture Notes in Computer Science #13062)
by Luis Espinosa-Anke Carlos Martín-Vide Irena SpasićThis book constitutes the proceedings of the 9th International Conference on Statistical Language and Speech Processing, SLSP 2021, held in Cardiff, UK, in November 2021.The 9 full papers presented in this volume were carefully reviewed and selected from 21 submissions. The papers present topics of either theoretical or applied interest discussing the employment of statistical models (including machine learning) within language and speech processing.
Statistical Language and Speech Processing: 4th International Conference, SLSP 2016, Pilsen, Czech Republic, October 11-12, 2016, Proceedings (Lecture Notes in Computer Science #9918)
by Pavel Král Carlos Martín-VideThis book constitutes the refereed proceedings of the 4th International Conference on Statistical Language and Speech Processing, SLSP 2016, held in Pilsen, Czech Republic, in October 2016. The 11 full papers presented together with two invited talks were carefully reviewed and selected from 38 submissions. The papers cover topics such as anaphora and coreference resolution; authorship identification, plagiarism and spam filtering; computer-aided translation; corpora and language resources; data mining and semantic web; information extraction; information retrieval; knowledge representation and ontologies; lexicons and dictionaries; machine translation; multimodal technologies; natural language understanding; neural representation of speech and language; opinion mining and sentiment analysis; parsing; part-of-speech tagging; question and answering systems; semantic role labeling; speaker identification and verification; speech and language generation; speech recognition; speech synthesis; speech transcription; speech correction; spoken dialogue systems; term extraction; text categorization; test summarization; user modeling.
Statistical Language and Speech Processing: 7th International Conference, SLSP 2019, Ljubljana, Slovenia, October 14–16, 2019, Proceedings (Lecture Notes in Computer Science #11816)
by Carlos Martín-Vide Matthew Purver Senja PollakThis book constitutes the proceedings of the 7th International Conference on Statistical Language and Speech Processing, SLSP 2019, held in Ljubljana, Slovenia, in October 2019. The 25 full papers presented together with one invited paper in this volume were carefully reviewed and selected from 48 submissions. They were organized in topical sections named: Dialogue and Spoken Language Understanding; Language Analysis and Generation; Speech Analysis and Synthesis; Speech Recognition; Text Analysis and Classification.
Statistical Language and Speech Processing: 5th International Conference, SLSP 2017, Le Mans, France, October 23–25, 2017, Proceedings (Lecture Notes in Computer Science #10583)
by Nathalie Camelin, Yannick Estève and Carlos Martín-VideThis book constitutes the refereed proceedings of the 5th International Conference on Statistical Language and Speech Processing, SLSP 2017, held in Le Mans, France, in October 2017. The 21 full papers presented were carefully reviewed and selected from 39 submissions. The papers cover topics such as anaphora and conference resolution; authorship identification, plagiarism and spam filtering; computer-aided translation; corpora and language resources; data mining and semanticweb; information extraction; information retrieval; knowledge representation and ontologies; lexicons and dictionaries; machine translation; multimodal technologies; natural language understanding; neural representation of speech and language; opinion mining and sentiment analysis; parsing; part-of-speech tagging; question and answering systems; semantic role labeling; speaker identification and verification; speech and language generation; speech recognition; speech synthesis; speech transcription; speech correction; spoken dialogue systems; term extraction; text categorization; test summarization; user modeling. They are organized in the following sections: language and information extraction; post-processing and applications of automatic transcriptions; speech paralinguistics and synthesis; speech recognition: modeling and resources.
Statistical Laws in Complex Systems: Combining Mechanistic Models and Data Analysis (Understanding Complex Systems)
by Eduardo G. AltmannThis book provides a unifying approach to the study of statistical laws, critically evaluating their role in the theoretical understanding of complex systems and the different data-analysis methods used to evaluate them. Statistical laws describe regular patterns observed in diverse scientific domains, ranging from the magnitude of earthquakes (Gutenberg-Richter law) and metabolic rates in organisms (Kleiber's law), to the frequency distribution of words in texts (Zipf's and Herdan-Heaps' laws), and productivity metrics of cities (urban scaling laws). The origins of these laws, their empirical validity, and the insights they provide into underlying systems have been subjects of scientific inquiry for centuries. Through a historical review and a unified analysis, this book argues that the persistent controversies on the validity of statistical laws are predominantly rooted not in novel empirical findings but in the discordance among data-analysis techniques, mechanistic models, and the interpretations of statistical laws. Starting with simple examples and progressing to more advanced time-series and statistical methods, this book and its accompanying repository provide comprehensive material for researchers interested in analyzing data, testing and comparing different laws, and interpreting results in both existing and new datasets.
Statistical Learning and Data Science
by Fionn Murtagh Mireille Gettler Summa Léon Bottou Bernard Goldfarb Catherine Pardoux Myriam TouatiData analysis is changing fast. Driven by a vast range of application domains and affordable tools, machine learning has become mainstream. Unsupervised data analysis, including cluster analysis, factor analysis, and low dimensionality mapping methods continually being updated, have reached new heights of achievement in the incredibly rich data wor
Statistical Learning of Complex Data (Studies in Classification, Data Analysis, and Knowledge Organization)
by Francesca Greselin Laura Deldossi Luca Bagnato Maurizio VichiThis book of peer-reviewed contributions presents the latest findings in classification, statistical learning, data analysis and related areas, including supervised and unsupervised classification, clustering, statistical analysis of mixed-type data, big data analysis, statistical modeling, graphical models and social networks. It covers both methodological aspects as well as applications to a wide range of fields such as economics, architecture, medicine, data management, consumer behavior and the gender gap. In addition, it describes the basic features of the software behind the data analysis results, and provides links to the corresponding codes and data sets where necessary. This book is intended for researchers and practitioners who are interested in the latest developments and applications in the field of data analysis and classification. It gathers selected and peer-reviewed contributions presented at the 11th Scientific Meeting of the Classification and Data Analysis Group of the Italian Statistical Society (CLADAG 2017), held in Milan, Italy, on September 13–15, 2017.
Statistical Learning Tools for Electricity Load Forecasting (Statistics for Industry, Technology, and Engineering)
by Jean-Michel Poggi Anestis Antoniadis Jairo Cugliari Matteo Fasiolo Yannig GoudeThis monograph explores a set of statistical and machine learning tools that can be effectively utilized for applied data analysis in the context of electricity load forecasting. Drawing on their substantial research and experience with forecasting electricity demand in industrial settings, the authors guide readers through several modern forecasting methods and tools from both industrial and applied perspectives – generalized additive models (GAMs), probabilistic GAMs, functional time series and wavelets, random forests, aggregation of experts, and mixed effects models. A collection of case studies based on sizable high-resolution datasets, together with relevant R packages, then illustrate the implementation of these techniques. Five real datasets at three different levels of aggregation (nation-wide, region-wide, or individual) from four different countries (UK, France, Ireland, and the USA) are utilized to study five problems: short-term point-wise forecasting, selection of relevant variables for prediction, construction of prediction bands, peak demand prediction, and use of individual consumer data. This text is intended for practitioners, researchers, and post-graduate students working on electricity load forecasting; it may also be of interest to applied academics or scientists wanting to learn about cutting-edge forecasting tools for application in other areas. Readers are assumed to be familiar with standard statistical concepts such as random variables, probability density functions, and expected values, and to possess some minimal modeling experience.
Statistical Learning Using Neural Networks: A Guide for Statisticians and Data Scientists with Python
by Basilio de Braganca Pereira Calyampudi Radhakrishna Rao Fabio Borges de OliveiraStatistical Learning using Neural Networks: A Guide for Statisticians and Data Scientists with Python introduces artificial neural networks starting from the basics and increasingly demanding more effort from readers, who can learn the theory and its applications in statistical methods with concrete Python code examples. It presents a wide range of widely used statistical methodologies, applied in several research areas with Python code examples, which are available online. It is suitable for scientists and developers as well as graduate students.Key Features: Discusses applications in several research areas Covers a wide range of widely used statistical methodologies Includes Python code examples Gives numerous neural network models This book covers fundamental concepts on Neural Networks including Multivariate Statistics Neural Networks, Regression Neural Network Models, Survival Analysis Networks, Time Series Forecasting Networks, Control Chart Networks, and Statistical Inference Results. This book is suitable for both teaching and research. It introduces neural networks and is a guide for outsiders of academia working in data mining and artificial intelligence (AI). This book brings together data analysis from statistics to computer science using neural networks.
Statistical Learning with Math and Python: 100 Exercises for Building Logic
by Joe SuzukiThe most crucial ability for machine learning and data science is mathematical logic for grasping their essence rather than knowledge and experience. This textbook approaches the essence of machine learning and data science by considering math problems and building Python programs. As the preliminary part, Chapter 1 provides a concise introduction to linear algebra, which will help novices read further to the following main chapters. Those succeeding chapters present essential topics in statistical learning: linear regression, classification, resampling, information criteria, regularization, nonlinear regression, decision trees, support vector machines, and unsupervised learning. Each chapter mathematically formulates and solves machine learning problems and builds the programs. The body of a chapter is accompanied by proofs and programs in an appendix, with exercises at the end of the chapter. Because the book is carefully organized to provide the solutions to the exercises in each chapter, readers can solve the total of 100 exercises by simply following the contents of each chapter. This textbook is suitable for an undergraduate or graduate course consisting of about 12 lectures. Written in an easy-to-follow and self-contained style, this book will also be perfect material for independent learning.
Statistical Learning with Math and R: 100 Exercises for Building Logic
by Joe SuzukiThe most crucial ability for machine learning and data science is mathematical logic for grasping their essence rather than knowledge and experience. This textbook approaches the essence of machine learning and data science by considering math problems and building R programs. As the preliminary part, Chapter 1 provides a concise introduction to linear algebra, which will help novices read further to the following main chapters. Those succeeding chapters present essential topics in statistical learning: linear regression, classification, resampling, information criteria, regularization, nonlinear regression, decision trees, support vector machines, and unsupervised learning. Each chapter mathematically formulates and solves machine learning problems and builds the programs. The body of a chapter is accompanied by proofs and programs in an appendix, with exercises at the end of the chapter. Because the book is carefully organized to provide the solutions to the exercises in each chapter, readers can solve the total of 100 exercises by simply following the contents of each chapter. This textbook is suitable for an undergraduate or graduate course consisting of about 12 lectures. Written in an easy-to-follow and self-contained style, this book will also be perfect material for independent learning.
Statistical Learning with Sparsity: The Lasso and Generalizations (Chapman & Hall/CRC Monographs on Statistics and Applied Probability)
by Trevor Hastie Robert Tibshirani Martin WainwrightDiscover New Methods for Dealing with High-Dimensional DataA sparse statistical model has only a small number of nonzero parameters or weights; therefore, it is much easier to estimate and interpret than a dense model. Statistical Learning with Sparsity: The Lasso and Generalizations presents methods that exploit sparsity to help recover the underl
Statistical Machine Learning: A Unified Framework (Chapman & Hall/CRC Texts in Statistical Science)
by Richard GoldenThe recent rapid growth in the variety and complexity of new machine learning architectures requires the development of improved methods for designing, analyzing, evaluating, and communicating machine learning technologies. Statistical Machine Learning: A Unified Framework provides students, engineers, and scientists with tools from mathematical statistics and nonlinear optimization theory to become experts in the field of machine learning. In particular, the material in this text directly supports the mathematical analysis and design of old, new, and not-yet-invented nonlinear high-dimensional machine learning algorithms. Features: Unified empirical risk minimization framework supports rigorous mathematical analyses of widely used supervised, unsupervised, and reinforcement machine learning algorithms Matrix calculus methods for supporting machine learning analysis and design applications Explicit conditions for ensuring convergence of adaptive, batch, minibatch, MCEM, and MCMC learning algorithms that minimize both unimodal and multimodal objective functions Explicit conditions for characterizing asymptotic properties of M-estimators and model selection criteria such as AIC and BIC in the presence of possible model misspecification This advanced text is suitable for graduate students or highly motivated undergraduate students in statistics, computer science, electrical engineering, and applied mathematics. The text is self-contained and only assumes knowledge of lower-division linear algebra and upper-division probability theory. Students, professional engineers, and multidisciplinary scientists possessing these minimal prerequisites will find this text challenging yet accessible. About the Author: Richard M. Golden (Ph.D., M.S.E.E., B.S.E.E.) is Professor of Cognitive Science and Participating Faculty Member in Electrical Engineering at the University of Texas at Dallas. Dr. Golden has published articles and given talks at scientific conferences on a wide range of topics in the fields of both statistics and machine learning over the past three decades. His long-term research interests include identifying conditions for the convergence of deterministic and stochastic machine learning algorithms and investigating estimation and inference in the presence of possibly misspecified probability models.
Statistical Machine Learning for Engineering with Applications (Lecture Notes in Statistics #227)
by Jürgen Franke Anita SchöbelThis book offers a leisurely introduction to the concepts and methods of machine learning. Readers will learn about classification trees, Bayesian learning, neural networks and deep learning, the design of experiments, and related methods. For ease of reading, technical details are avoided as far as possible, and there is a particular emphasis on applicability, interpretation, reliability and limitations of the data-analytic methods in practice. To cover the common availability and types of data in engineering, training sets consisting of independent as well as time series data are considered. To cope with the scarceness of data in industrial problems, augmentation of training sets by additional artificial data, generated from physical models, as well as the combination of machine learning and expert knowledge of engineers are discussed. The methodological exposition is accompanied by several detailed case studies based on industrial projects covering a broad range of engineering applications from vehicle manufacturing, process engineering and design of materials to optimization of production processes based on image analysis. The focus is on fundamental ideas, applicability and the pitfalls of machine learning in industry and science, where data are often scarce. Requiring only very basic background in statistics, the book is ideal for self-study or short courses for engineering and science students.
Statistical Machine Translation
by Philipp KoehnThe filed of machine translation has recently been energized by the emergence of statistical techniques, which have brought the dream of automatic language translation closer to reality. This class-tested textbook, authored by an active researcher in the field, provides a gentle and accessible introduction to the latest methods and enables the reader to build machine translation systems for any language pair.
A Statistical Mechanical Interpretation of Algorithmic Information Theory (SpringerBriefs in Mathematical Physics #36)
by Kohtaro TadakiThis book is the first one that provides a solid bridge between algorithmic information theory and statistical mechanics. Algorithmic information theory (AIT) is a theory of program size and recently is also known as algorithmic randomness. AIT provides a framework for characterizing the notion of randomness for an individual object and for studying it closely and comprehensively. In this book, a statistical mechanical interpretation of AIT is introduced while explaining the basic notions and results of AIT to the reader who has an acquaintance with an elementary theory of computation.A simplification of the setting of AIT is the noiseless source coding in information theory. First, in the book, a statistical mechanical interpretation of the noiseless source coding scheme is introduced. It can be seen that the notions in statistical mechanics such as entropy, temperature, and thermal equilibrium are translated into the context of noiseless source coding in a natural manner. Then, the framework of AIT is introduced. On this basis, the introduction of a statistical mechanical interpretation of AIT is begun. Namely, the notion of thermodynamic quantities, such as free energy, energy, and entropy, is introduced into AIT. In the interpretation, the temperature is shown to be equal to the partial randomness of the values of all these thermodynamic quantities, where the notion of partial randomness is a stronger representation of the compression rate measured by means of program-size complexity. Additionally, it is demonstrated that this situation holds for the temperature itself as a thermodynamic quantity. That is, for each of all the thermodynamic quantities above, the computability of its value at temperature T gives a sufficient condition for T to be a fixed point on partial randomness.In this groundbreaking book, the current status of the interpretation from both mathematical and physical points of view is reported. For example, a total statistical mechanical interpretation of AIT that actualizes a perfect correspondence to normal statistical mechanics can be developed by identifying a microcanonical ensemble in the framework of AIT. As a result, the statistical mechanical meaning of the thermodynamic quantities of AIT is clarified. In the book, the close relationship of the interpretation to Landauer's principle is pointed out.
Statistical Mechanics of Lattice Systems: A Concrete Mathematical Introduction
by Sacha Friedli Yvan VelenikThis motivating textbook gives a friendly, rigorous introduction to fundamental concepts in equilibrium statistical mechanics, covering a selection of specific models, including the Curie–Weiss and Ising models, the Gaussian free field, O(n) models, and models with Kać interactions. Using classical concepts such as Gibbs measures, pressure, free energy, and entropy, the book exposes the main features of the classical description of large systems in equilibrium, in particular the central problem of phase transitions. It treats such important topics as the Peierls argument, the Dobrushin uniqueness, Mermin–Wagner and Lee–Yang theorems, and develops from scratch such workhorses as correlation inequalities, the cluster expansion, Pirogov–Sinai Theory, and reflection positivity. Written as a self-contained course for advanced undergraduate or beginning graduate students, the detailed explanations, large collection of exercises (with solutions), and appendix of mathematical results and concepts also make it a handy reference for researchers in related areas. Builds a narrative around the driving concepts, focusing on specific examples and models. Self-contained and accessible. Features numerous exercises and solutions, as well as a comprehensive appendix.
Statistical Mechanics of Neural Networks
by Haiping HuangThis book highlights a comprehensive introduction to the fundamental statistical mechanics underneath the inner workings of neural networks. The book discusses in details important concepts and techniques including the cavity method, the mean-field theory, replica techniques, the Nishimori condition, variational methods, the dynamical mean-field theory, unsupervised learning, associative memory models, perceptron models, the chaos theory of recurrent neural networks, and eigen-spectrums of neural networks, walking new learners through the theories and must-have skillsets to understand and use neural networks. The book focuses on quantitative frameworks of neural network models where the underlying mechanisms can be precisely isolated by physics of mathematical beauty and theoretical predictions. It is a good reference for students, researchers, and practitioners in the area of neural networks.
Statistical Methods and Analyses for Medical Devices
by Scott A. PardoThis book provides a reference for people working in the design, development, and manufacturing of medical devices. While there are no statistical methods specifically intended for medical devices, there are methods that are commonly applied to various problems in the design, manufacturing, and quality control of medical devices. The aim of this book is not to turn everyone working in the medical device industries into mathematical statisticians; rather, the goal is to provide some help in thinking statistically, and knowing where to go to answer some fundamental questions, such as justifying a method used to qualify/validate equipment, or what information is necessary to support the choice of sample sizes.While, there are no statistical methods specifically designed for analysis of medical device data, there are some methods that seem to appear regularly in relation to medical devices. For example, the assessment of receiver operating characteristic curves is fundamental to development of diagnostic tests, and accelerated life testing is often critical for assessing the shelf life of medical device products. Another example is sensitivity/specificity computations are necessary for in-vitro diagnostics, and Taguchi methods can be very useful for designing devices. Even notions of equivalence and noninferiority have different interpretations in the medical device field compared to pharmacokinetics. It contains topics such as dynamic modeling, machine learning methods, equivalence testing, and experimental design, for example.This book is for those with no statistical experience, as well as those with statistical knowledgeable—with the hope to provide some insight into what methods are likely to help provide rationale for choices relating to data gathering and analysis activities for medical devices.
Statistical Methods and Applications in Forestry and Environmental Sciences (Forum for Interdisciplinary Mathematics)
by Girish Chandra Raman Nautiyal Hukum ChandraThis book presents recent developments in statistical methodologies with particular relevance to applications in forestry and environmental sciences. It discusses important methodologies like ranked set sampling, adaptive cluster sampling, small area estimation, calibration approach-based estimators, design of experiments, multivariate techniques, Internet of Things, and ridge regression methods. It also covers the history of the implementation of statistical techniques in Indian forestry and the National Forest Inventory of India.The book is a valuable resource for applied statisticians, students, researchers, and practitioners in the forestry and environment sector. It includes real-world examples and case studies to help readers apply the techniques discussed. It also motivates academicians and researchers to use new technologies in the areas of forestry and environmental sciences with the help of software like R, MATLAB, Statistica, and Mathematica.
Statistical Methods for Astronomical Data Analysis
by Asis Kumar Chattopadhyay Tanuka ChattopadhyayThis book introduces "Astrostatistics" as a subject in its own right with rewarding examples, including work by the authors with galaxy and Gamma Ray Burst data to engage the reader. This includes a comprehensive blending of Astrophysics and Statistics. The first chapter's coverage of preliminary concepts and terminologies for astronomical phenomenon will appeal to both Statistics and Astrophysics readers as helpful context. Statistics concepts covered in the book provide a methodological framework. A unique feature is the inclusion of different possible sources of astronomical data, as well as software packages for converting the raw data into appropriate forms for data analysis. Readers can then use the appropriate statistical packages for their particular data analysis needs. The ideas of statistical inference discussed in the book help readers determine how to apply statistical tests. The authors cover different applications of statistical techniques already developed or specifically introduced for astronomical problems, including regression techniques, along with their usefulness for data set problems related to size and dimension. Analysis of missing data is an important part of the book because of its significance for work with astronomical data. Both existing and new techniques related to dimension reduction and clustering are illustrated through examples. There is detailed coverage of applications useful for classification, discrimination, data mining and time series analysis. Later chapters explain simulation techniques useful for the development of physical models where it is difficult or impossible to collect data. Finally, coverage of the many R programs for techniques discussed makes this book a fantastic practical reference. Readers may apply what they learn directly to their data sets in addition to the data sets included by the authors.
Statistical Methods for Recommender Systems
by Deepak K. Agarwal Bee-Chung ChenDesigning algorithms to recommend items such as news articles and movies to users is a challenging task in numerous web applications. The crux of the problem is to rank items based on users' responses to different items to optimize for multiple objectives. Major technical challenges are high dimensional prediction with sparse data and constructing high dimensional sequential designs to collect data for user modeling and system design. This comprehensive treatment of the statistical issues that arise in recommender systems includes detailed, in-depth discussions of current state-of-the-art methods such as adaptive sequential designs (multi-armed bandit methods), bilinear random-effects models (matrix factorization) and scalable model fitting using modern computing paradigms like MapReduce. The authors draw upon their vast experience working with such large-scale systems at Yahoo! and LinkedIn, and bridge the gap between theory and practice by illustrating complex concepts with examples from applications they are directly involved with.
Statistical Methods for Spatial Planning and Monitoring
by Silvestro Montrone Paola PerchinunnoThe book aims to investigate methods and techniques for spatial statistical analysis suitable to model spatial information in support of decision systems. Over the last few years there has been a considerable interest in these tools and in the role they can play in spatial planning and environmental modelling. One of the earliest and most famous definition of spatial planning was "a geographical expression to the economic, social, cultural and ecological policies of society": borrowing from this point of view, this text shows how an interdisciplinary approach is an effective way to an harmonious integration of national policies with regional and local analysis. A wide range of spatial models and techniques is, also, covered: spatial data mining, point processes analysis, nearest neighbor statistics and cluster detection, Fuzzy Regression model and local indicators of spatial association; all of these tools provide the policy-maker with a valuable support to policy development.
Statistical Methods for Spoken Dialogue Management
by Blaise ThomsonSpeech is the most natural mode of communication and yet attempts to build systems which support robust habitable conversations between a human and a machine have so far had only limited success. A key reason is that current systems treat speech input as equivalent to a keyboard or mouse, and behaviour is controlled by predefined scripts that try to anticipate what the user will say and act accordingly. But speech recognisers make many errors and humans are not predictable; the result is systems which are difficult to design and fragile in use. Statistical methods for spoken dialogue management takes a radically different view. It treats dialogue as the problem of inferring a user's intentions based on what is said. The dialogue is modelled as a probabilistic network and the input speech acts are observations that provide evidence for performing Bayesian inference. The result is a system which is much more robust to speech recognition errors and for which a dialogue strategy can be learned automatically using reinforcement learning. The thesis describes both the architecture, the algorithms needed for fast real-time inference over very large networks, model parameter estimation and policy optimisation. This ground-breaking work will be of interest both to practitioners in spoken dialogue systems and to cognitive scientists interested in models of human behaviour.
Statistical Methods in Computer Security (Statistics: A Series of Textbooks and Monographs)
by William W. S. ChenStatistical Methods in Computer Security summarizes discussions held at the recent Joint Statistical Meeting to provide a clear layout of current applications in the field. This blue-ribbon reference discusses the most influential advancements in computer security policy, firewalls, and security issues related to passwords. It addresses crime and m