Basic properties of the risk of an estimator #
Main statements #
iSup_bayesRisk_le_minimaxRisk: the maximal Bayes risk is less than or equal to the minimax risk.bayesRisk_le_bayesRisk_comp: data-processing inequality for the Bayes risk with respect to a prior: if we compose the data generating kernelPwith a Markov kernel, then the Bayes risk increases.bayesRisk_le_iInf: forPa Markov kernel, the Bayes risk is less than⨅ y, ∫⁻ θ, ℓ θ y ∂π.
In several cases, there is no information in the data about the parameter and the Bayes risk takes its maximal value.
bayesRisk_const: if the data generating kernel is constant, then the Bayes risk is equal to⨅ y, ∫⁻ θ, ℓ θ y ∂π.bayesRisk_of_subsingleton: if the observation space is a subsingleton, then the Bayes risk is equal to⨅ y, ∫⁻ θ, ℓ θ y ∂π.
TODO #
In many cases, the maximal Bayes risk and the minimax risk are equal (by a so-called minimax theorem).
The maximal Bayes risk is less than or equal to the minimax risk.
See avgRisk_const_left' for a similar result with integrals swapped.
See avgRisk_const_left for a similar result with integrals swapped.
See avgRisk_const_right for a simpler result when P is a Markov kernel.
See avgRisk_const_right' for a similar result when P is not a Markov kernel.
See bayesRisk_le_iInf for a simpler result when P is a Markov kernel.
See bayesRisk_le_iInf' for a similar result when P is not a Markov kernel.
See avgRisk_le_mul for the usual case in which π is a probability measure and the kernels
are Markov.
For a bounded loss, the Bayes risk with respect to a prior is bounded by a constant.
See bayesRisk_le_mul for the usual cases where all measures are probability measures.
For a bounded loss, the Bayes risk with respect to a prior is bounded by a constant.
For a bounded loss, the Bayes risk with respect to a prior is finite.
Data processing inequality for the Bayes risk with respect to a prior: composition of the data generating kernel by a Markov kernel increases the risk.
Data processing inequality for the Bayes risk with respect to a prior: taking the map of the data generating kernel by a function increases the risk.