Why and how to construct an epistemic justification of machine learning?

Špelda, Petr; Střítecký, Vít

doi:10.1007/s11229-024-04702-z

Why and how to construct an epistemic justification of machine learning?

dc.contributor.author	Špelda, Petr
dc.contributor.author	Střítecký, Vít
dc.date.accessioned	2024-09-09T15:15:28Z
dc.date.available	2024-09-09T15:15:28Z
dc.date.issued	2024
dc.identifier.uri	https://hdl.handle.net/20.500.14178/2604
dc.description.abstract	Consider a set of shuffled observations drawn from a fixed probability distribution over some instance domain. What enables learning of inductive generalizations which proceed from such a set of observations? The scenario is worthwhile because it epistemically characterizes most of machine learning. This kind of learning from observations is also inverse and ill-posed. What reduces the non-uniqueness of its result and, thus, its problematic epistemic justification, which stems from a one-to-many relation between the observations and many learnable generalizations? The paper argues that this role belongs to any complexity regularization which satisfies Norton's Material Theory of Induction (MTI) by localizing the inductive risk to facts in the given domain. A prime example of the localization is the Lottery Ticket Hypothesis (LTH) about overparameterized neural networks. The explanation of MTI's role in complexity regularization of neural networks is provided by analyzing the stability of Empirical Risk Minimization (ERM), an inductive rule that controls the learning process and leads to an inductive generalization on the given set of observations. In cases where ERM might become asymptotically unstable, making the justification of the generalization by uniform convergence unavailable, LTH and MTI can be used to define a local stability. A priori, overparameterized neural networks are such cases and the combination of LTH and MTI can block ERM's trivialization caused by equalizing the strengths of its inductive support for risk minimization. We bring closer the investigation of generalization in artificial neural networks and the study of inductive inference and show the division of labor between MTI and the optimality justifications (developed by Gerhard Schurz) in machine learning.	en
dc.language.iso	en
dc.relation.url	https://doi.org/10.1007/s11229-024-04702-z
dc.rights	Creative Commons Uveďte původ 4.0 International	cs
dc.rights	Creative Commons Attribution 4.0 International	en
dc.title	Why and how to construct an epistemic justification of machine learning?	en
dcterms.accessRights	embargoedAccess
dcterms.license	https://creativecommons.org/licenses/by/4.0/legalcode
dc.date.updated	2024-09-09T15:15:28Z
dc.subject.keyword	Lottery ticket hypothesis	en
dc.subject.keyword	Complexity regularization	en
dc.subject.keyword	Material theory of induction	en
dc.subject.keyword	Empirical risk minimization	en
dc.identifier.eissn	1573-0964
dc.relation.fundingReference	info:eu-repo/grantAgreement/MSM//LX22NPO5101
dc.date.embargoStartDate	2024-09-09
dc.date.embargoEndDate	2024-08-10
dc.type.obd	73
dc.type.version	info:eu-repo/semantics/publishedVersion
dc.identifier.doi	10.1007/s11229-024-04702-z
dc.identifier.utWos	001287981600001
dc.identifier.eidScopus	2-s2.0-85200732008
dc.identifier.obd	650392
dc.subject.rivPrimary	50000::50600::50601
dcterms.isPartOf.name	Synthese
dcterms.isPartOf.issn	0039-7857
dcterms.isPartOf.journalYear	2024
dcterms.isPartOf.journalVolume	204
dcterms.isPartOf.journalIssue	2
uk.faculty.primaryId	118
uk.faculty.primaryName	Fakulta sociálních věd	cs
uk.faculty.primaryName	Faculty of Social Sciences	en
uk.department.primaryId	2492
uk.department.primaryName	Katedra bezpečnostních studií	cs
uk.department.primaryName	Department of Security Studies	en
dc.description.pageRange	1-24
dc.type.obdHierarchyCs	ČLÁNEK V ČASOPISU::článek v časopisu::původní článek	cs
dc.type.obdHierarchyEn	JOURNAL ARTICLE::journal article::original article	en
dc.type.obdHierarchyCode	73::152::206	en
uk.displayTitle	Why and how to construct an epistemic justification of machine learning?	en

Soubory tohoto záznamu

Název:: s11229-024-04702-z.pdf
Velikost:: 715.4Kb
Formát:: PDF

Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Fakulta sociálních věd

Zobrazit minimální záznam