OPTIMIZATION OF TWO-ALTERNATIVE BATCH PROCESSING WITH PARAMETER ESTIMATION BASED ON DATA INSIDE BATCHES

A. V. Kolnogorov

Abstract


We consider optimization of two-alternative batch data processing within the framework of the Gaussian one-armed bandit problem. This means that there are two alternative processing methods with different efficiencies and the effectiveness of the second method is a priori unknown. It is necessary to determine which method is more effective and ensure its preferential use, so that the effectiveness of the second method is evaluated during the data processing inside batches. This approach is advisable to use if the volumes of batches and their number are not very large. Recursive equations for calculating Bayesian risk and regret in the usual and invariant form with a control horizon equal to one are obtained.


Keywords


Gaussian one-armed bandit; batch processing; Bayesian and minimax approaches; invariant description

References


Berry D.A., Fristedt B. Bandit Problems: Sequential Allocation of Experiments. London, New York, Chapman and Hall, 1985.

Presman E.L., Sonin I.M. Sequential Control with Incomplete Information. New York, Academic, 1990.

Tsetlin M.L. Automaton Theory and Modeling of Biological Systems. New

York, Academic, 1973.

Sragovich V.G. Mathematical Theory of Adaptive Control. Singapore,

World Sci., 2006.

Lattimore T., Szepesvari C. Bandit Algorithms. Cambridge, Cambridge University Press, 2020.

Kolnogorov A.V. One-Armed Bandit Problem for Parallel Data Processing Systems. Problems of Information Transmission, 2015, vol. 51, no. 2,

pp. 177--191. DOI: 10.1134/S0032946015020088

Vogel W. An Asymptotic Minimax Theorem for the Two-Armed Bandit

Problem. Ann. Math. Statist., 1960, vol. 31, no. 2, pp. 444--451.

Perchet V., Rigollet P., Chassang S., Snowberg E. Batched Bandit Problems. Ann. Statist., 2016, vol. 44, no. 2, pp. 660--681. DOI: 10.1214/15-AOS1381

Bradt R.N., Johnson S.M., Karlin S. On Sequential Designs for Maximizing the Sum of n Observations. Ann. Math. Statist., 1956,

vol. 27, pp. 1060--1074. DOI: 10.1214/aoms/1177728073

Chernoff H., Ray S.N. A Bayes Sequential Sampling Inspection Plan. Ann. Math. Statist., 1965, vol. 36, pp. 1387--1407. DOI: 10.1214/aoms/1177699898

Kolnogorov A. Gaussian One-Armed Bandit Problem. 2021 XVII International Symposium "Problems of Redundancy in Information and Control Systems" (REDUNDANCY). Moscow. IEEE, 2021, pp. 74--79.

DOI: 10.1109/REDUNDANCY52534.2021.9606464

Kolnogorov A.V. Gaussian One-Armed Bandit with Both Unknown Parameters. Siberian Electronic Mathematical Reports, 2022, vol. 19, no. 2, pp. 639--650.


Refbacks

  • There are currently no refbacks.


 Save