Information and Complexity in Statistical Modeling

Information and Complexity in Statistical Modeling by Jorma Rissanen, published by Springer New York on November 23, 2010, is a softcover reprint of the hardcover first edition from 2007, comprising 142 pages in English. This book explores the concept that no statistical model can be definitively labeled as “true” or “false,” emphasizing the varying performance of models that can be assessed through probability frameworks. Rissanen presents a foundation for statistical modeling based on extracting information from data, formalizing key concepts such as complexity, learnable information, and noise.
Readers will find a comprehensive examination of modeling principles, particularly the Minimum Description Length (MDL) principle, which aids in encoding data efficiently. The text offers a unified approach to parameters and hypothesis testing, allowing for a logical assessment of confidence in test results. While basic knowledge of probability and statistics is necessary, a moderate level of mathematical proficiency is recommended. This edition serves as a valuable resource for graduate students and researchers in fields such as signal processing, bioinformatics, and machine learning, providing insights that encourage further exploration in modern engineering disciplines.
Official synopsis Publisher
No statistical model is “true” or “false,” “right” or “wrong”; the models just have varying performance, which can be assessed. The main theme in this book is to teach modeling based on the principle that the objective is to extract the information from data that can be learned with suggested classes of probability models. The intuitive and fundamental concepts of complexity, learnable information, and noise are formalized, which provides a firm information theoretic foundation for statistical modeling. Inspired by Kolmogorov’s structure function in the algorithmic theory of complexity, this is accomplished by finding the shortest code length, called the stochastic complexity, with which the data can be encoded when advantage is taken of the models in a suggested class, which amounts to the MDL (Minimum Description Length) principle. The complexity, in turn, breaks up into the shortest code length for the optimal model in a set of models that can be optimally distinguished from the given data and the rest, which defines “noise” as the incompressible part in the data without useful information.
Such a view of the modeling problem permits a unified treatment of any type of parameters, their number, and even their structure. Since only optimally distinguished models are worthy of testing, we get a logically sound and straightforward treatment of hypothesis testing, in which for the first time the confidence in the test result can be assessed. Although the prerequisites include only basic probability calculus and statistics, a moderate level of mathematical proficiency would be beneficial. The different and logically unassailable view of statistical modelling should provide excellent grounds for further research and suggest topics for graduate students in all fields of modern engineering, including and not restricted to signal and image processing, bioinformatics, pattern recognition, and machine learning to mention just a few.
Publisher
Topics
FAQ
What is “Information and Complexity in Statistical Modeling” about?
Who is the author of “Information and Complexity in Statistical Modeling”?
When was “Information and Complexity in Statistical Modeling” published?
What is the ISBN for “Information and Complexity in Statistical Modeling”?
What are the book details (language, pages, edition)?
