Mitigating Catastrophic Forgetting in Deep Learning in a Streaming Setting Using Historical Summary

Dash, Sajal; Yin, Junqi; Shankar, Mallikarjun (Arjun); Wang, Feiyi; Feng, Wu-Chun

doi:10.1109/DRBSD754563.2021.00006

Title: Mitigating Catastrophic Forgetting in Deep Learning in a Streaming Setting Using Historical Summary

Conference · Mon Nov 01 00:00:00 EDT 2021

DOI:https://doi.org/10.1109/DRBSD754563.2021.00006· OSTI ID:1885239

Dash, Sajal ^[1];

^[1];

^[1]; Feng, Wu-Chun ^[2]

ORNL
Virginia Tech, Blacksburg, VA

Recent advancements in scientific equipment and the adaptation of electronics and the Internet of Things (IoT) in our everyday lives resulted in large and complex data production at a high rate. Making meaningful and timely knowledge discovery at a modest cost from this big data is difficult for computing power and storage limitations. Training deep learning models incrementally in a streaming setting can help us with overcoming these limitations. However, in a well-known phenomenon named catastrophic forgetting, incrementally trained models increasingly perform poorly on the past data. To mitigate catastrophic forgetting in training in a streaming setting, we propose constructing a historical summary over time and use the summary with newly arrived data during incremental training. We propose various data summarization techniques such as random sampling, micro clustering, coreset computation, and Auto Encoders to counteract catastrophic forgetting. We built a pipeline for incremental training with a historical summary for training deep learning models for streaming data. We demonstrate the effectiveness of historical summary in mitigating catastrophic forgetting using three case studies involving three different deep learning applications: an Artificial Neural Network (ANN) for classification task on MNIST dataset, a language model (RNN-LM) on the WikiText2 dataset, and a Convolutional Neural Network (CNN), ResNet50 to classify the ImageNet dataset. Through the training of the models, we observe that catastrophic forgetting is evident in ANN and CNN but not in an RNN. For the first task, our method recovers up to 47.9% lost accuracy due to catastrophic forgetting. For the third task, the historical summary recovers classification accuracy by up to 25%. For the second task, though there is not proof of catastrophic forgetting, the training performance (PPL) improves by up to 26% with historical summary.

View Conference

Cite

Export

Save

Research Organization:: Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC05-00OR22725

OSTI ID:: 1885239

Resource Relation:: Conference: 2021 7th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-7) - St. Louis, Missouri, United States of America - 11/14/2021 5:00:00 AM-11/14/2021 5:00:00 AM

Country of Publication:: United States

Language:: English

Similar Records

Deep Learning Based Superconducting Radio-Frequency Cavity Fault Classification at Jefferson Laboratory

Journal Article · Mon Jan 03 00:00:00 EST 2022 · Frontiers in Artificial Intelligence · OSTI ID:1885239

Vidyaratne, Lasitha; Carpenter, Adam; Powers, Tom; +4 more

Utilization of Synthetic Near-Infrared Spectra via Generative Adversarial Network to Improve Wood Stiffness Prediction

Journal Article · Fri Mar 01 00:00:00 EST 2024 · Sensors · OSTI ID:1885239

Ali, Syed Danish; Raut, Sameen; Dahlen, Joseph; +4 more

Large Scale Caching and Streaming of Training Data for Online Deep Learning

Conference · Fri Jul 01 00:00:00 EDT 2022 · OSTI ID:1885239

Liu, Jie; Nicolae, Bogdan; Li, Dong; +4 more

Title: Mitigating Catastrophic Forgetting in Deep Learning in a Streaming Setting Using Historical Summary

Citation Formats

Similar Records

Related Subjects