Depth Functions for Partial Orders with a Descriptive Analysis of Machine Learning Algorithms
Abstract: We propose a framework for descriptively analyzing sets of partial orders based on the concept of depth functions. Despite intensive studies of depth functions in linear and metric spaces, there is very little discussion on depth functions for non-standard data types such as partial orders. We introduce an adaptation of the well-known simplicial depth to the set of all partial orders, the union-free generic (ufg) depth. Moreover, we utilize our ufg depth for a comparison of machine learning algorithms based on multidimensional performance measures. Concretely, we analyze the distribution of different classifier performances over a sample of standard benchmark data sets. Our results promisingly demonstrate that our approach differs substantially from existing benchmarking approaches and, therefore, adds a new perspective to the vivid debate on the comparison of classifiers.
- W. Armstrong. Dependency structures of data base relationships. International Federation for Information Processing Congress, 74:580–583, 1974.
- Mining minimal non-redundant association rules using frequent closed itemsets. In J. Lloyd, V. Dahl, U. Furbach, M. Kerber, K. Lau, C. Palamidessi, L. Pereira, Y. Sagiv, and P. Stuckey, editors, Computational Logic — CL 2000, pages 972–986. Springer, 2000.
- Should we really use post-hoc tests based on mean-ranks? The Journal of Machine Learning Research, 17(1):152–161, 2016.
- Lattices, closures systems and implication bases: A survey of structural aspects and algorithms. Theoretical Computer Science, 743:93–109, 2018.
- Statistical models for partial orders based on data depth and formal concept analysis. In D. Ciucci, I. Couso, J. Medina, D. Slezak, D. Petturiti, B. Bouchon-Meunier, and R. Yager, editors, Information Processing and Management of Uncertainty in Knowledge-Based Systems, pages 17–30. Springer, 2022.
- R. Bradley and M. Terry. Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika, 39(3/4):324–345, 1952.
- A stochastic dominance approach to financial risk management strategies. Journal of Econometrics, 187(2):472–485, 2015.
- L. Chang. Partial order relations for classification comparisons. Canadian Journal of Statistics, 48(2):152–166, 2020.
- I. Couso and D. Dubois. Statistical reasoning with set-valued information: Ontic vs. epistemic views. International Journal of Approximate Reasoning, 55(7):1502–1518, 2014.
- J. Demšar. Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research, 7:1–30, 2006.
- J. Eckhoff. Helly, Radon, and Carathéodory type theorems. In Handbook of Convex Geometry, pages 389–448. Elsevier, 1993.
- Domain-based benchmark experiments: Exploratory and inferential analysis. Austrian Journal of Statistics, 41(1):5–26, 2012.
- M. Fligner and J. Verducci. Distance based ranking models. Journal of the Royal Statistical Society. Series B (Methodological), 48(3):359–369, 1986.
- Package ‘glmnet’. CRAN R Repositary, 2021.
- B. Ganter and R. Wille. Formal Concept Analysis: Mathematical Foundations. Springer, 2012.
- K. Hechenbichler and K. Schliep. Weighted k-nearest-neighbor techniques and ordinal classification. Technical Report, LMU, 2004. URL http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-1769-9.
- The design and analysis of benchmark experiments. Journal of Computational and Graphical Statistics, 14(3):675–699, 2005.
- Concepts for decision making under severe uncertainty with partial ordinal and partial cardinal preferences. International Journal of Approximate Reasoning, 98:112–131, 2018a.
- A probabilistic evaluation framework for preference aggregation reflecting group homogeneity. Mathematical Social Sciences, 96:49–62, 2018b.
- Information efficient learning of complexly structured preferences: Elicitation procedures and their application to decision making under uncertainty. International Journal of Approximate Reasoning, 144:69–91, 2022a.
- Statistical comparisons of classifiers by generalized stochastic dominance. Arxiv Preprint, 2022b. URL https://arxiv.org/abs/2209.01857.
- Sequential decision making with partially ordered preferences. Artificial Intelligence, 175:1346 – 1365, 2011.
- G. Lebanon and Y. Mao. Non-parametric modeling of partially ranked data. In J. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems, volume 20. Curran Associates, Inc., 2007.
- H. Levy and A. Levy. Ordering uncertain options under inflation: A note. The Journal of Finance, 39(4):1223–1229, 1984.
- R. Liu. On a notion of data depth based on random simplices. The Annals of Statistics, 18:405–414, 1990.
- S. López-Pintado and J. Romo. On the concept of depth for functional data. Journal of the American statistical Association, 104(486):718–734, 2009.
- Credal sum-product networks. In A. Antonucci, G. Corani, I. Couso, and S. Destercke, editors, International Symposium on Imprecise Probability: Theories and Applications, volume 62, 10–14 Jul 2017.
- K. Mosler. Multivariate Dispersion, Central Regions, and Depth: The Lift Zonoid Approach. Springer, 2002.
- K. Mosler and P. Mozharovskyi. Choosing among notions of multivariate depth statistics. Statistical Science, 37:348–368, 2022.
- Learning partially ranked data based on graph regularization. arXiv preprint arXiv:1902.10963, 2019.
- Incompleteness and incomparability in preference aggregation: Complexity results. Artificial Intelligence, 175(7):1272–1289, 2011.
- Statistical modelling under epistemic data imprecision: some results on estimating multinomial distributions and logistic regression for coarse categorical data. In T. Augustin, S. Doria, E. Miranda, and E. Quaeghebeur, editors, ISIPTA ’15, Proceedings of the Ninth International Symposium on Imprecise Probability: Theories and Applications, 2015a.
- Statistical modelling in surveys without neglecting the undecided: Multinomial logistic regression models and imprecise classification trees under ontic data imprecision. In T. Augustin, S. Doria, E. Miranda, and E. Quaeghebeur, editors, ISIPTA ’15, Proceedings of the Ninth International Symposium on Imprecise Probability: Theories and Applications, 2015b.
- G. Schollmeyer. Lower quantiles for complete lattices. Technical Report, LMU, 2017a. URL http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-40448-7.
- G. Schollmeyer. Application of lower quantiles for complete lattices to ranking data: Analyzing outlyingness of preference orderings. Technical Report, LMU, 2017b. URL http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-40452-9.
- G. Schollmeyer and H. Blocher. A note on the connectedness property of union-free generic sets of partial orders. Arxiv Preprint, 2023. URL https://www.foundstat.statistik.uni-muenchen.de/personen/mitglieder/blocher/index.html.
- Detecting stochastic dominance for poset-valued random variables as an example of linear programming on closure systems. Technical Report, LMU, 2017. URL http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-40416-0.
- A representation of partially ordered preferences. Annals of Statistics, 23:2168–2217, 1995.
- J. Stoye. Statistical inference for interval identified parameters. In T. Augustin, F. Coolen, S. Moral, and M. Troffaes, editors, ISIPTA ’09, Proceedings of the Sixth International Symposium on Imprecise Probabilities: Theories and Applications, 2009.
- Package ‘rpart’, 2015. URL http://cran.ma.ic.ac.uk/web/packages/rpart/rpart.pdf. [Accessed: 15.02.2023].
- J. Tukey. Mathematics and the picturing of data. In R. James, editor, Proceedings of the International Congress of Mathematicians Vancouver, pages 523–531, Vancouver, 1975. Mathematics-Congresses.
- Openml: Networked science in machine learning. SIGKDD Explorations, 15(2):49–60, 2013.
- V. Vapnik and A. Chervonenkis. On the uniform convergence of relative frequencies of events to their probabilities. In V. Vovk, H. Papadopoulos, and A. Gammerman, editors, Measures of Complexity: Festschrift for Alexey Chervonenkis, pages 11–30. Springer, 2015.
- M. Wright and A. Ziegler. ranger: A fast implementation of random forests for high dimensional data in C++ and R. Journal of Statistical Software, 77(1):1–17, 2017.
- M. Zaffalon. The naive credal classifier. Journal of Statistical Planning and Inference, 105(1):5–21, 2002.
- Evaluating credal classifiers by utility-discounted predictive accuracy. International Journal of Approximate Reasoning, 53(8):1282–1301, 2012.
- Y. Zuo and R. Serfling. General notions of statistical depth function. The Annals of Statistics, 28(2):461 – 482, 2000.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.