Balancing Knowledge Distillation for Imbalance Learning with Bilevel Optimization

Source

arxiv.orgfull article ↗

Publisher summary· verbatim

arXiv:2605.17839v3 Announce Type: replace-cross Abstract: Knowledge distillation transfers knowledge from a high capacity teacher to a compact student using a mixture of hard and soft losses. On imbalanced data, a fixed weighting between hard and soft losses becomes brittle the learning process. Rec

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

Discussion

No replies yet. Be first.

Balancing Knowledge Distillation for Imbalance Learning with Bilevel Optimization

Related coverage

Balancing Knowledge Distillation for Imbalance Learning with Bilevel Optimization

Related coverage