A PDE Approach to the Problems of Optimality of Expectations
Main Article Content
Abstract
Let (X, Z) be a bivariate random vector. A predictor of X based on Z is just a Borel function g(Z). The problem of "least squares prediction" of X given the observation Z is to find the global minimum point of the functional E[(X − g(Z))2] with respect to all random variables g(Z), where g is a Borel function. It is well known that the solution of this problem is the conditional expectation E(X|Z). We also know that, if for a nonnegative smooth function F: R×R → R, arg ming(Z)E[F(X, g(Z))] = E[X|Z], for all X and Z, then F(x, y) is a Bregmann loss function. It is also of interest, for a fixed ϕ to find F (x, y), satisfying, arg ming(Z)E[F(X, g(Z))] = ϕ(E[X|Z]), for all X and Z. In more general setting, a stronger problem is to find F (x, y) satisfying arg miny∈RE[F (X, y)] = ϕ(E[X]), ∀X. We study this problem and develop a partial differential equation (PDE) approach to solution of these problems.
Article Details
References
- K.B. Athreya, S.N. Lahiri, Measure Theory and Probability Theory, Springer Texts in Statistics, Springer, New York, 2006.
- A. Banerjee, X. Guo, H. Wang, On the Optimality of Conditional Expectation as a Bregman Predictor, IEEE Trans. Inform. Theory. 51 (2005), 2664–2669. https://doi.org/10.1109/tit.2005.850145.
- H.H. Bauschke, M.S. Macklem, J.B. Sewell, X. Wang, Klee Sets and Chebyshev Centers for the Right Bregman Distance, J. Approx. Theory. 162 (2010), 1225–1244. https://doi.org/10.1016/j.jat.2010.01.001.
- A. Ben-Tal, A. Charnes, M. Teboulle, Entropic Means, J. Math. Anal. Appl. 139 (1989), 537–551. https://doi.org/10.1016/0022-247x(89)90128-5.
- Y. Censor, S. Zenios, Parallel Optimization: Theory, Algorithms, and Applications, Oxford University Press, London, 1998.
- I. Csiszar, Why Least Squares and Maximum Entropy? An Axiomatic Approach to Inference for Linear Inverse Problems, Ann. Stat. 19 (1991), 2032–2066. https://doi.org/10.1214/aos/1176348385.
- I. Csiszar, Generalized Projections for Non-Negative Functions, in: Proceedings of 1995 IEEE International Symposium on Information Theory, IEEE, Whistler, BC, Canada, 1995: p. 6. https://doi.org/10.1109/ISIT.1995.531108.
- F. Deutsch, Best Approximation in Inner Product Spaces, Springer-Verlag, New York, 2021.
- G. Grimmett, D. Stirzaker, Probability and Random Processes, Oxford University Press, Oxford, 2004.
- S. Karlin, H.M. Taylor, A Second Course in Stochastic Processes, 2nd ed. Academic Press, San Diego, 1991.
- M. Hasanov, The Spectra of Two-Parameter Quadratic Operator Pencils, Math. Computer Model. 54 (2011), 742–755. https://doi.org/10.1016/j.mcm.2011.03.018.
- D. Reem, S. Reich, A. De Pierro, Re-Examination of Bregman Functions and New Properties of Their Divergences, Optimization. 68 (2018), 279–348. https://doi.org/10.1080/02331934.2018.1543295.
- D. Williams, Probability with Martingales, Cambridge Mathematical Textbooks, Cambridge University Press, Cambridge, 2001.