Mathematical statistics is understood as “a section of mathematics devoted to mathematical methods for collecting, systematizing, processing and interpreting statistical data, as well as using them for scientific or practical conclusions. The rules and procedures of mathematical statistics are based on the theory of probability, which makes it possible to evaluate the accuracy and reliability of the conclusions obtained in each problem on the basis of the available statistical material. At the same time, statistical data refers to information about the number of objects in any more or less extensive collection that have certain characteristics.

According to the type of problems being solved, mathematical statistics is usually divided into three sections: data description, estimation, and hypothesis testing.

According to the type of statistical data being processed, mathematical statistics is divided into four areas:

— one-dimensional statistics (statistics random variables), in which the observation result is described real number;

- multidimensional statistical analysis, where the result of observation over the object is described by several numbers (vector);

- statistics of random processes and time series, where the result of observation is a function;

— statistics of objects of non-numerical nature, in which the result of observation has a non-numerical nature, for example, is a set ( geometric figure), ordering or obtained as a result of measurement on a qualitative basis.

Historically, some areas of statistics of objects of non-numerical nature (in particular, problems of estimating the percentage of defective products and testing hypotheses about it) and one-dimensional statistics were the first to appear. Mathematical apparatus it is easier for them, so their example usually demonstrates the basic ideas of mathematical statistics.

Only those methods of data processing, ie. mathematical statistics are evidence-based, which are based on probabilistic models of relevant real phenomena and processes. It's about about models of consumer behavior, the occurrence of risks, the functioning of technological equipment, obtaining the results of an experiment, the course of a disease, etc. A probabilistic model of a real phenomenon should be considered built if the quantities under consideration and the relationships between them are expressed in terms of probability theory.

Correspondence to the probabilistic model of reality, i.e. its adequacy is substantiated, in particular, with the help of statistical methods for testing hypotheses.

Incredible data processing methods are exploratory, they can only be used in preliminary data analysis, since they do not make it possible to assess the accuracy and reliability of the conclusions obtained on the basis of limited statistical material.

Probabilistic and statistical methods are applicable wherever it is possible to construct and substantiate a probabilistic model of a phenomenon or process. Their use is mandatory when conclusions drawn from sample data are transferred to the entire population (for example, from a sample to an entire batch of products).

In specific areas of application, both probabilistic-statistical methods of wide application and specific ones are used. For example, in the section of production management devoted to statistical methods of product quality management, applied mathematical statistics (including the design of experiments) are used. With the help of its methods, a statistical analysis of the accuracy and stability of technological processes and a statistical assessment of quality are carried out. Specific methods include methods of statistical acceptance control of product quality, statistical regulation of technological processes, assessment and control of reliability, etc.

Such applied probabilistic-statistical disciplines as reliability theory and queuing theory are widely used. The content of the first of them is clear from the title, the second deals with the study of systems such as a telephone exchange, which receives calls at random times - the requirements of subscribers dialing numbers on their telephones. The duration of the service of these requirements, i.e. the duration of conversations is also modeled by random variables. A great contribution to the development of these disciplines was made by Corresponding Member of the USSR Academy of Sciences A.Ya. Khinchin (1894-1959), academician of the Academy of Sciences of the Ukrainian SSR B.V. Gnedenko (1912-1995) and other domestic scientists.

Every investigation in the field of random phenomena is always rooted in experiment, in experimental data. Numerical data that is collected when studying any feature of some object is called statistical. Statistical data are the initial material of the study. In order for them to be of scientific or practical value, they must be processed by methods of mathematical statistics.

Math statistics is a scientific discipline, the subject of which is the development of methods for recording, describing and analyzing statistical experimental data obtained as a result of observations of massive random phenomena.

The main tasks of mathematical statistics are:

    determination of the law of distribution of a random variable or a system of random variables;

    testing the plausibility of hypotheses;

    determination of unknown distribution parameters.

All methods of mathematical statistics are based on the theory of probability. However, due to the specificity of the problems being solved, mathematical statistics is separated from the theory of probability into an independent field. If in the theory of probability the model of the phenomenon is considered to be given and the possible real course of this phenomenon is calculated (Fig. 1), then in mathematical statistics an appropriate probabilistic model is selected based on statistical data (Fig. 2).

Fig.1. General problem of probability theory

Fig.2. General problem of mathematical statistics

As a scientific discipline, mathematical statistics developed along with the theory of probability. The mathematical apparatus of this science was built in the second half of the 19th century.

2. General population and sample.

To study statistical methods, the concepts of general and sample populations are introduced. In general, under general population is understood as a random variable X with the distribution function
. A sample set or a sample of volume n for a given random variable X is a set
independent observations of this quantity, where is called the sample value or implementation of the random variable X. In this way, can be considered as numbers (if the experiment is carried out and the sample was taken) and as random variables (before the experiment), since they vary from sample to sample.

Example 1. To determine the dependence of the thickness of a tree trunk on its height, 200 trees were selected. In this case, the sample size is n=200.

Example 2 As a result of sawing particle boards on a circular saw, 15 values ​​of the specific cutting work were obtained. In this case, n=15.

D
In order to confidently judge the feature of the general population that we are interested in according to the sample data, the objects of the sample must correctly represent it, that is, the sample must be representative(representative). The representativeness of the sample is usually achieved by random selection of objects: each object of the general population is provided with an equal probability of being included in the sample with all the others.

Fig.3. Demonstration of the representativeness of the sample

Math statistics is one of the main sections of such a science as mathematics, and is a branch that studies the methods and rules for processing certain data. In other words, it explores ways to uncover patterns that are inherent in large collections of identical objects, based on their sample survey.

A task this section consists in constructing methods for estimating the probability or making a certain decision about the nature of developing events, based on the results obtained. Tables, charts, and correlation fields are used to describe the data. rarely applied.

Mathematical statistics are used in various fields of science. For example, it is important for the economy to process information about homogeneous sets of phenomena and objects. They can be products manufactured by industry, personnel, profit data, etc. Depending on the mathematical nature of the results of observations, one can single out the statistics of numbers, the analysis of functions and objects of a non-numerical nature, and multidimensional analysis. In addition, they consider general and particular (related to the restoration of dependencies, the use of classifications, selective studies) tasks.

The authors of some textbooks believe that the theory of mathematical statistics is only a section of the theory of probability, while others believe that it is an independent science with its own goals, objectives and methods. However, in any case, its use is very extensive.

Thus, mathematical statistics is most clearly applicable in psychology. Its use will allow the specialist to correctly substantiate, find the relationship between the data, generalize them, avoid many logical errors, and much more. It should be noted that it is often simply impossible to measure this or that psychological phenomenon or personality trait without computational procedures. This suggests that the basics of this science are necessary. In other words, it can be called the source and basis of probability theory.

The method of research, which relies on the consideration of statistical data, is used in other areas. However, it should immediately be noted that its features, when applied to objects that have a different nature of origin, are always unique. Therefore, it does not make sense to combine physical science into one science. The general features of this method are reduced to counting a certain number of objects that are included in a particular group, as well as studying the distribution quantitative traits and the application of probability theory to obtain certain conclusions.

Elements of mathematical statistics are used in areas such as physics, astronomy, etc. Here, the values ​​of characteristics and parameters, hypotheses about the coincidence of any characteristics in two samples, about the symmetry of the distribution, and much more can be considered.

Mathematical statistics plays an important role in their implementation. Their goal is most often to build adequate methods for estimating and testing hypotheses. At present, computer technologies are of great importance in this science. They allow not only to significantly simplify the calculation process, but also to create samples for replication or when studying the suitability of the results obtained in practice.

In the general case, the methods of mathematical statistics help to draw two conclusions: either to make the desired judgment about the nature or properties of the data being studied and their relationships, or to prove that the results obtained are not enough to draw conclusions.

Math statistics is a modern branch of mathematics that deals with statistical description results of experiments and observations, as well as building mathematical models containing concepts probabilities. The theoretical basis of mathematical statistics is probability theory.

In the structure of mathematical statistics, two main sections are traditionally distinguished: descriptive statistics and statistical inference (Figure 1.1).

Rice. 1.1. Main sections of mathematical statistics

Descriptive statistics is used for:

o generalization of indicators of one variable (statistics of a random sample);

o identifying relationships between two or more variables (correlation-regression analysis).

Descriptive statistics makes it possible to obtain new information, quickly understand and comprehensively evaluate it, that is, it performs the scientific function of describing the objects of study, which justifies its name. The methods of descriptive statistics are designed to turn a set of individual empirical data into a system of forms and numbers that are visual for perception: frequency distributions; indicators of trends, variability, communication. These methods calculate the statistics of a random sample, which serve as the basis for the implementation of statistical inferences.

Statistical Inference give the opportunity:

o evaluate the accuracy, reliability and effectiveness of sample statistics, find errors that occur in the process of statistical research (statistical evaluation)

o summarize the parameters of the general population obtained on the basis of sample statistics (checking statistical hypotheses).

the main objective scientific research- this is the acquisition of new knowledge about a large class of phenomena, persons or events, which are commonly called the general population.

Population is the totality of objects of study, sample- its part, which is formed in a certain scientifically substantiated way 2.

The term "general population" is used when it comes to a large but finite set of objects under study. For example, about the totality of applicants in Ukraine in 2009 or the totality of children preschool age the city of Rivne. General populations can reach significant volumes, be finite and infinite. In practice, as a rule, one deals with finite sets. And if the ratio of the size of the general population to the size of the sample is more than 100, then, according to Glass and Stanley, the estimation methods for finite and infinite populations give essentially the same results. The general set can also be called the complete set of values ​​of some attribute. The fact that the sample belongs to the general population is the main basis for assessing the characteristics of the general population according to the characteristics of the sample.

Main idea mathematical statistics is based on the belief that a complete study of all objects of the general population in most scientific problems is either practically impossible or economically impractical, since it requires a lot of time and significant material costs. Therefore, in mathematical statistics, it is used selective approach, the principle of which is shown in the diagram in Fig. 1.2.

For example, according to the formation technology, the samples are randomized (simple and systematic), stratified, clustered (see Section 4).

Rice. 1.2. Scheme of application of methods of mathematical statistics According to selective approach the use of mathematical and statistical methods can be carried out in the following sequence (see Fig. 1.2):

o with general population, properties of which are subject to research, certain methods form a sample- a typical but limited number of objects to which research methods are applied;

o as a result of observational methods, experimental actions and measurements on sample objects, empirical data are obtained;

o processing of empirical data using descriptive statistics methods gives sample indicators, which are called statisticians - like the name of the discipline, by the way;

o applying statistical inference methods to statistician, receive parameters that characterize the properties the general population.

Example 1.1. In order to assess the stability of the level of knowledge (variable x) testing of a randomized sample of 3 students with a volume of n. The tests contained m tasks, each of which was evaluated according to the scoring system: "completed" "- 1," not fulfilled "- 0. average current achievements of students remained X

3 randomized sample(from the English. Random - random) is a representative sample, which is formed according to the strategy of random tests.

at the level of previous years / h? Solution sequence:

o find out a meaningful hypothesis of the type: "if the current test results do not differ from the past, then we can consider the level of students' knowledge to be unchanged, and studying proccess- stable";

o formulate an adequate statistical hypothesis, such as the null hypothesis H 0 that the "current GPA X is not statistically different from the average of previous years / h", i.e. H 0: X = ⁄ r, against the corresponding alternative hypothesis X Ф ^ ;

o build empirical distributions of the investigated variable X;

o define(if necessary) correlations, for example, between a variable X and other indicators, build regression lines;

o check compliance empirical distribution normal law;

o evaluate the value of point indicators and the confidence interval of parameters, for example, the average;

o define criteria for testing statistical hypotheses;

o test statistical hypotheses based on the selected criteria;

o formulate a decision on the statistical null hypothesis on a certain significance level;

o move from the decision to accept or reject the statistical null hypothesis of the interpretation of the conclusions regarding the meaningful hypothesis;

o formulate meaningful conclusions.

So, if we summarize the above procedures, the application of statistical methods consists of three main blocks:

The transition from an object of reality to an abstract mathematical and statistical scheme, that is, the construction of a probabilistic model of a phenomenon, process, property;

Carrying out computational actions by proper mathematical means within the framework of a probabilistic model based on the results of measurements, observations, experiments and the formulation of statistical conclusions;

Interpretation of statistical conclusions about the real situation and making an appropriate decision.

Statistical methods for processing and interpreting data are based on probability theory. The theory of probability is the basis of the methods of mathematical statistics. Without the use of fundamental concepts and laws of probability theory, it is impossible to generalize the conclusions of mathematical statistics, and hence their reasonable use for scientific and practical purposes.

Thus, the task of descriptive statistics is to transform a set of sample data into a system of indicators - statistics - frequency distributions, measures of central tendency and variability, coupling coefficients, and the like. However, statistics are characteristics, in fact, of a particular sample. Of course, it is possible to calculate sample distributions, sample means, variances, etc., but such "data analysis" is of limited scientific and educational value. The "mechanical" transfer of any conclusions drawn on the basis of such indicators to other populations is not correct.

In order to be able to transfer sample indicators or others, or to more common populations, it is necessary to have mathematically justified provisions on the conformity and ability of sample characteristics with the characteristics of these common so-called populations. Such provisions are based on theoretical approaches and schemes associated with probabilistic models of reality, for example, on the axiomatic approach, in the law big numbers etc. Only with their help it is possible to transfer the properties that are established by the results of the analysis of limited empirical information, either to other or to widespread sets. Thus, the construction, the laws of functioning, the use of probabilistic models, is the subject mathematical field called "probability theory", becomes the essence of statistical methods.

Thus, in mathematical statistics, two parallel lines of indicators are used: the first line, which is relevant to practice (these are sample indicators) and the second, based on theory (these are indicators of a probabilistic model). For example, the empirical frequencies that are determined on the sample correspond to the concepts of theoretical probability; sample mean (practice) corresponds expected value(theory), etc. Moreover, in studies, selective characteristics, as a rule, are primary. They are calculated on the basis of observations, measurements, experiments, after which they undergo a statistical assessment of the ability and effectiveness, testing of statistical hypotheses in accordance with the objectives of the research, and in the end are accepted with a certain probability as indicators of the properties of the studied populations.

Question. A task.

1. Describe the main sections of mathematical statistics.

2. What is the main idea of ​​mathematical statistics?

3. Describe the ratio of the general and sample populations.

4. Explain the scheme for applying the methods of mathematical statistics.

5. Specify the list of the main tasks of mathematical statistics.

6. What are the main blocks of the application of statistical methods? Describe them.

7. Expand the connection between mathematical statistics and probability theory.

How are probability and mathematical statistics used? These disciplines are the basis of probabilistic-statistical methods decision making. To use their mathematical apparatus, you need tasks decision making express in terms of probabilistic-statistical models. Application of a specific probabilistic statistical method decision making consists of three stages:

  • transition from economic, managerial, technological reality to an abstract mathematical and statistical scheme, i.e. building a probabilistic model of a control system, a technological process, decision-making procedures, in particular according to the results of statistical control, etc.;
  • carrying out calculations and obtaining conclusions by purely mathematical means within the framework of a probabilistic model;
  • interpretation of mathematical and statistical conclusions in relation to a real situation and making an appropriate decision (for example, on the conformity or non-compliance of product quality with established requirements, the need to adjust the technological process, etc.), in particular, conclusions (on the proportion of defective units of products in a batch, on specific form of distribution laws controlled parameters technological process, etc.).

Mathematical statistics uses the concepts, methods and results of probability theory. Consider the main issues of building probabilistic models decision making in economic, managerial, technological and other situations. For the active and correct use of normative-technical and instructive-methodical documents on probabilistic-statistical methods decision making prior knowledge is required. So, it is necessary to know under what conditions one or another document should be applied, what initial information is necessary to have for its selection and application, what decisions should be made based on the results of data processing, etc.

Examples of application of probability theory and mathematical statistics. Let us consider several examples when probabilistic-statistical models are a good tool for solving managerial, industrial, economic, and national economic problems. So, for example, in the novel by A.N. Tolstoy's "Walking through the torments" (vol. 1) says: "the workshop gives twenty-three percent of the marriage, you hold on to this figure," Strukov said to Ivan Ilyich.

The question arises how to understand these words in the conversation of factory managers, since one unit of production cannot be defective by 23%. It can be either good or defective. Perhaps Strukov meant that a large batch contains approximately 23% of defective units. Then the question arises, what does "about" mean? Let 30 out of 100 tested units of products turn out to be defective, or out of 1000-300, or out of 100000-30000, etc., should Strukov be accused of lying?

Or another example. The coin that is used as a lot must be "symmetrical", i.e. when it is thrown, on average, in half the cases, the coat of arms should fall out, and in half the cases - the lattice (tails, number). But what does "average" mean? If you spend many series of 10 throws in each series, then there will often be series in which a coin drops out 4 times with a coat of arms. For a symmetrical coin, this will happen in 20.5% of the series. And if there are 40,000 coats of arms for 100,000 tosses, can the coin be considered symmetrical? Procedure decision making is based on the theory of probability and mathematical statistics.

The example under consideration may not seem serious enough. However, it is not. The draw is widely used in organizing industrial feasibility experiments, for example, when processing the results of measuring the quality index (friction moment) of bearings depending on various technological factors (the influence of a conservation environment, methods of preparing bearings before measurement, the effect of bearing load in the measurement process, etc.). P.). Suppose it is necessary to compare the quality of bearings depending on the results of their storage in different conservation oils, i.e. in composition oils and . When planning such an experiment, the question arises which bearings should be placed in the composition oil, and which - in the composition oil, but in such a way as to avoid subjectivity and ensure the objectivity of the decision.

The answer to this question can be obtained by drawing lots. A similar example can be given with the quality control of any product. Sampling is done to decide whether or not an inspected lot of products meets the specified requirements. Based on the results of the sample control, a conclusion is made about the entire batch. In this case, it is very important to avoid subjectivity in the formation of the sample, i.e. it is necessary that each unit of product in the controlled lot has the same probability of being selected in the sample. Under production conditions, the selection of units of production in the sample is usually carried out not by lot, but by special tables of random numbers or with the help of computer random number generators.

Similar problems of ensuring the objectivity of comparison arise when comparing different schemes. production organization, remuneration, during tenders and competitions, selection of candidates for vacant positions, etc. Everywhere you need a lottery or similar procedures. Let us explain by the example of identifying the strongest and second strongest teams when organizing a tournament according to the Olympic system (the loser is eliminated). Let the stronger team always win over the weaker one. It is clear that the strongest team will definitely become the champion. The second strongest team will reach the final if and only if it has no games with the future champion before the final. If such a game is planned, then the second strongest team will not reach the final. The one who plans the tournament can either "knock out" the second strongest team from the tournament ahead of time, bringing it down in the first meeting with the leader, or provide it with second place, ensuring meetings with weaker teams until the final. To avoid subjectivity, draw lots. For an 8-team tournament, the probability that the two strongest teams will meet in the final is 4/7. Accordingly, with a probability of 3/7, the second strongest team will leave the tournament ahead of schedule.

In any measurement of product units (using a caliper, micrometer, ammeter, etc.), there are errors. To find out if there are systematic errors, it is necessary to make repeated measurements of a unit of production, the characteristics of which are known (for example, a standard sample). It should be remembered that in addition to the systematic error, there is also a random error.

Therefore, the question arises of how to find out from the measurement results whether there is a systematic error. If we note only whether the error obtained during the next measurement is positive or negative, then this problem can be reduced to the previous one. Indeed, let's compare the measurement with the throwing of a coin, the positive error - with the loss of the coat of arms, the negative - with the lattice (zero error with a sufficient number of divisions of the scale almost never occurs). Then checking the absence of a systematic error is equivalent to checking the symmetry of the coin.

The purpose of these considerations is to reduce the problem of checking the absence of a systematic error to the problem of checking the symmetry of a coin. The above reasoning leads to the so-called "criterion of signs" in mathematical statistics.

In statistical regulation of technological processes, based on the methods of mathematical statistics, rules and plans for statistical control of processes are developed, aimed at timely detection of the disorder of technological processes, taking measures to adjust them and prevent the release of products that do not meet the established requirements. These measures are aimed at reducing production costs and losses from the supply of low-quality products. With statistical acceptance control, based on the methods of mathematical statistics, quality control plans are developed by analyzing samples from product batches. The difficulty lies in being able to correctly build probabilistic-statistical models decision making on the basis of which the above questions can be answered. In mathematical statistics, probabilistic models and methods for testing hypotheses have been developed for this, in particular, hypotheses that the proportion of defective units of production is equal to a certain number, for example, (remember the words of Strukov from the novel by A.N. Tolstoy).

Assessment tasks. In a number of managerial, industrial, economic, national economic situations, problems of a different type arise - problems of estimating the characteristics and parameters of probability distributions.

Consider an example. Let a batch of N electric lamps come to the control. A sample of n electric lamps was randomly selected from this batch. A number of natural questions arise. How can the average service life of electric lamps be determined from the results of testing the sample elements and with what accuracy can this characteristic be estimated? How will the accuracy change if a larger sample is taken? At what number of hours can it be guaranteed that at least 90% of the electric lamps will last more than hours?

Suppose that when testing a sample with a volume of electric lamps, electric lamps turned out to be defective. Then the following questions arise. What limits can be specified for the number of defective electric lamps in a batch, for the level of defectiveness, etc.?

Or, in a statistical analysis of the accuracy and stability of technological processes, it is necessary to evaluate such quality indicators, as an average controlled parameter and the degree of its spread in the process under consideration. According to the theory of probability, it is advisable to use its mathematical expectation as the average value of a random variable, and the variance, standard deviation, or the coefficient of variation. This raises the question: how to evaluate these statistical characteristics according to sample data and with what accuracy can this be done? There are many similar examples. Here it was important to show how probability theory and mathematical statistics can be used in production management when making decisions in the field of statistical product quality management.

What is "mathematical statistics"? Mathematical statistics is understood as "a section of mathematics devoted to the mathematical methods of collecting, systematizing, processing and interpreting statistical data, as well as using them for scientific or practical conclusions. The rules and procedures of mathematical statistics are based on the theory of probability, which makes it possible to evaluate the accuracy and reliability of the conclusions obtained in each task based on the available statistical material" [ [ 2.2], p. 326]. At the same time, statistical data refers to information about the number of objects in any more or less extensive collection that have certain characteristics.

According to the type of problems being solved, mathematical statistics is usually divided into three sections: data description, estimation, and hypothesis testing.

According to the type of statistical data being processed, mathematical statistics is divided into four areas:

  • one-dimensional statistics (statistics of random variables), in which the result of an observation is described by a real number;
  • multidimensional statistical analysis, where the result of observation of an object is described by several numbers (vector);
  • statistics of random processes and time series, where the result of observation is a function;
  • statistics of objects of a non-numerical nature, in which the result of an observation is of a non-numerical nature, for example, it is a set (a geometric figure), an ordering, or obtained as a result of a measurement by a qualitative attribute.

Historically, some areas of statistics of non-numerical objects (in particular, problems of estimating the percentage of marriage and testing hypotheses about it) and one-dimensional statistics were the first to appear. The mathematical apparatus is simpler for them, therefore, by their example, they usually demonstrate the main ideas of mathematical statistics.

Only those methods of data processing, ie. mathematical statistics are evidence-based, which are based on probabilistic models of relevant real phenomena and processes. We are talking about models of consumer behavior, the occurrence of risks, the functioning of technological equipment, obtaining the results of an experiment, the course of a disease, etc. A probabilistic model of a real phenomenon should be considered built if the quantities under consideration and the relationships between them are expressed in terms of probability theory. Correspondence to the probabilistic model of reality, i.e. its adequacy is substantiated, in particular, using statistical methods for testing hypotheses.

Incredible data processing methods are exploratory, they can only be used in preliminary data analysis, since they do not make it possible to assess the accuracy and reliability of the conclusions obtained on the basis of limited statistical material.

Probabilistic and statistical methods are applicable wherever it is possible to construct and substantiate a probabilistic model of a phenomenon or process. Their use is mandatory when conclusions drawn from sample data are transferred to the entire population (for example, from a sample to an entire batch of products).

In specific applications, they are used as probabilistic statistical methods wide application, as well as specific ones. For example, in the section of production management devoted to statistical methods of product quality management, applied mathematical statistics (including the design of experiments) are used. With the help of its methods, statistical analysis accuracy and stability of technological processes and statistical quality assessment. Specific methods include methods of statistical acceptance control of product quality, statistical regulation of technological processes, assessment and control of reliability, etc.

Such applied probabilistic-statistical disciplines as reliability theory and queuing theory are widely used. The content of the first of them is clear from the title, the second deals with the study of systems such as a telephone exchange, which receives calls at random times - the requirements of subscribers dialing numbers on their telephones. The duration of the service of these requirements, i.e. the duration of conversations is also modeled by random variables. A great contribution to the development of these disciplines was made by Corresponding Member of the USSR Academy of Sciences A.Ya. Khinchin (1894-1959), Academician of the Academy of Sciences of the Ukrainian SSR B.V. Gnedenko (1912-1995) and other domestic scientists.

Briefly about the history of mathematical statistics. Mathematical statistics as a science begins with the works of the famous German mathematician Carl Friedrich Gauss (1777-1855), who, based on the theory of probability, investigated and substantiated least square method, created by him in 1795 and used to process astronomical data (in order to refine the orbit of the minor planet Ceres). One of the most popular probability distributions, the normal one, is often named after him, and in the theory of random processes, the main object of study is Gaussian processes.

AT late XIX in. - the beginning of the twentieth century. a major contribution to mathematical statistics was made by English researchers, primarily K. Pearson (1857-1936) and R.A. Fisher (1890-1962). In particular, Pearson developed the "chi-square" criterion for testing statistical hypotheses, and Fisher - analysis of variance, the theory of experiment planning, the maximum likelihood method of parameter estimation.

In the 30s of the twentieth century. Pole Jerzy Neumann (1894-1977) and Englishman E. Pearson developed a general theory of testing statistical hypotheses, and Soviet mathematicians Academician A.N. Kolmogorov (1903-1987) and Corresponding Member of the USSR Academy of Sciences N.V. Smirnov (1900-1966) laid the foundations of non-parametric statistics. In the forties of the twentieth century. Romanian A. Wald (1902-1950) built the theory of consistent statistical analysis.

Mathematical statistics is rapidly developing at the present time. So, over the past 40 years, four fundamentally new areas of research can be distinguished [ [ 2.16 ] ]:

  • development and implementation mathematical methods planning experiments;
  • development of statistics of objects of non-numerical nature as an independent direction in applied mathematical statistics;
  • development of statistical methods resistant to small deviations from the used probabilistic model;
  • wide deployment of work on the creation of computer software packages designed for statistical data analysis.

Probabilistic-statistical methods and optimization. The idea of ​​optimization permeates modern applied mathematical statistics and other statistical methods. Namely, methods of planning experiments, statistical acceptance control, statistical control of technological processes, etc. On the other hand, optimization formulations in theory decision making, for example, the applied theory of optimizing product quality and the requirements of standards, provide for the widespread use of probabilistic-statistical methods, primarily applied mathematical statistics.

In production management, in particular, when optimizing product quality and standard requirements, it is especially important to apply statistical methods at the initial stage life cycle products, i.e. at the stage of research preparation of experimental design developments (development of promising requirements for products, preliminary design, terms of reference for experimental design development). This is due to the limited information available at the initial stage of the product life cycle and the need to predict the technical possibilities and economic situation for the future. Statistical Methods should be applied at all stages of solving the optimization problem - when scaling variables, developing mathematical models for the functioning of products and systems, conducting technical and economic experiments, etc.

In optimization problems, including optimization of product quality and standard requirements, all areas of statistics are used. Namely - the statistics of random variables, multivariate statistical analysis, statistics of random processes and time series, statistics of objects of non-numerical nature. The choice of a statistical method for the analysis of specific data should be carried out according to the recommendations [