OPTIMIZATION OF TWO-ALTERNATIVE BATCH PROCESSING WITH PARAMETER ESTIMATION BASED ON DATA INSIDE BATCHES
Abstract
We consider optimization of two-alternative batch data processing within the framework of the Gaussian one-armed bandit problem. This means that there are two alternative processing methods with different efficiencies and the effectiveness of the second method is a priori unknown. It is necessary to determine which method is more effective and ensure its preferential use, so that the effectiveness of the second method is evaluated during the data processing inside batches. This approach is advisable to use if the volumes of batches and their number are not very large. Recursive equations for calculating Bayesian risk and regret in the usual and invariant form with a control horizon equal to one are obtained.
Keywords
Full Text:
PDF (Русский)References
Berry D.A., Fristedt B. Bandit Problems: Sequential Allocation of Experiments. London, New York, Chapman and Hall, 1985.
Presman E.L., Sonin I.M. Sequential Control with Incomplete Information. New York, Academic, 1990.
Tsetlin M.L. Automaton Theory and Modeling of Biological Systems. New
York, Academic, 1973.
Sragovich V.G. Mathematical Theory of Adaptive Control. Singapore,
World Sci., 2006.
Lattimore T., Szepesvari C. Bandit Algorithms. Cambridge, Cambridge University Press, 2020.
Kolnogorov A.V. One-Armed Bandit Problem for Parallel Data Processing Systems. Problems of Information Transmission, 2015, vol. 51, no. 2,
pp. 177--191. DOI: 10.1134/S0032946015020088
Vogel W. An Asymptotic Minimax Theorem for the Two-Armed Bandit
Problem. Ann. Math. Statist., 1960, vol. 31, no. 2, pp. 444--451.
Perchet V., Rigollet P., Chassang S., Snowberg E. Batched Bandit Problems. Ann. Statist., 2016, vol. 44, no. 2, pp. 660--681. DOI: 10.1214/15-AOS1381
Bradt R.N., Johnson S.M., Karlin S. On Sequential Designs for Maximizing the Sum of n Observations. Ann. Math. Statist., 1956,
vol. 27, pp. 1060--1074. DOI: 10.1214/aoms/1177728073
Chernoff H., Ray S.N. A Bayes Sequential Sampling Inspection Plan. Ann. Math. Statist., 1965, vol. 36, pp. 1387--1407. DOI: 10.1214/aoms/1177699898
Kolnogorov A. Gaussian One-Armed Bandit Problem. 2021 XVII International Symposium "Problems of Redundancy in Information and Control Systems" (REDUNDANCY). Moscow. IEEE, 2021, pp. 74--79.
DOI: 10.1109/REDUNDANCY52534.2021.9606464
Kolnogorov A.V. Gaussian One-Armed Bandit with Both Unknown Parameters. Siberian Electronic Mathematical Reports, 2022, vol. 19, no. 2, pp. 639--650.
Refbacks
- There are currently no refbacks.