Show simple item record

dc.contributor.authorCueto-Mendoza, Eduardo
dc.contributor.authorKelleher, John D.
dc.date.accessioned2024-11-01T10:20:28Z
dc.date.available2024-11-01T10:20:28Z
dc.date.issued2024
dc.date.submitted2024en
dc.identifier.citationEduardo Cueto-Mendoza and John D. Kelleher, A framework for measuring the training efficiency of a neural architecture, Artificial Intelligence Review, 57, 349, 2024, 1 - 33en
dc.identifier.otherY
dc.descriptionPUBLISHEDen
dc.description.abstractMeasuring Efficiency in neural network system development is an open research problem. This paper presents an experimental framework to measure the training efficiency of a neural architecture. To demonstrate our approach, we analyze the training efficiency of Convolutional Neural Networks and Bayesian equivalents on the MNIST and CIFAR-10 tasks. Our results show that training efficiency decays as training progresses and varies across different stopping criteria for a given neural model and learning task. We also find a non-linear relationship between training stopping criteria, training Efficiency, model size, and training Efficiency. Furthermore, we illustrate the potential confounding effects of overtraining on measuring the training efficiency of a neural architecture. Regarding relative training efficiency across different architectures, our results indicate that CNNs are more efficient than BCNNs on both datasets. More generally, as a learning task becomes more complex, the relative difference in training efficiency between different architectures becomes more pronounced.en
dc.description.sponsorshipResearch Ireland Centre for Research Training in Digitally-Enhanced Reality (d-real) under Grant No. 18/CRT/6224. ADAPT Research Centre for AI-Driven Digital Content Technology, which is funded by the Research Ireland Research Centres Programme and is co-funded under the European Regional Development Fund (ERDF) through Grant 13/RC/2106 P2en
dc.format.extent1en
dc.format.extent33en
dc.language.isoenen
dc.relation.ispartofseriesArtificial Intelligence Review;
dc.relation.ispartofseries57;
dc.relation.ispartofseries349;
dc.rightsYen
dc.subjectArtificial Intelligenceen
dc.subjectMachine Learningen
dc.subjectDeep Learningen
dc.subjectSustainabilityen
dc.subjectSustainable AIen
dc.subjectResearch Subject Categories::MATHEMATICSen
dc.titleA framework for measuring the training efficiency of a neural architectureen
dc.typeJournal Articleen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/kellehj3
dc.identifier.rssinternalid272479
dc.identifier.doihttps://doi.org/10.1007/s10462-024-10943-8
dc.rights.ecaccessrightsopenAccess
dc.subject.TCDTagARTIFICIAL INTELLIGENCEen
dc.subject.TCDTagArtificial Intelligenceen
dc.subject.TCDTagMACHINE LEARNINGen
dc.identifier.orcid_id0000-0001-6462-3248
dc.status.accessibleNen
dc.contributor.sponsorResearch Ireland Centre for Research Training in Digitally-Enhanced Reality (d-real)en
dc.contributor.sponsorGrantNumber18/CRT/6224.en
dc.contributor.sponsorResearch Ireland ADAPT Centre for AI Driven Digital Content Technologyen
dc.contributor.sponsorGrantNumber13/RC/2106 P2en
dc.identifier.urihttps://hdl.handle.net/2262/110154


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record