Since 18 of December 2019 conferences.iaea.org uses Nucleus credentials. Visit our help pages for information on how to Register and Sign-in using Nucleus.

November 29, 2021 to December 6, 2021
Virtual event
Europe/Vienna timezone
30 Nov - 3 Dec, 2021 Abstract submission open NOW

Machine learning models for real-time inference of plasma dynamics using the 2-dimensional beam emission spectroscopy system at DIII-D

Dec 1, 2021, 4:20 PM
10m
Virtual event

Virtual event

Short Talks Deep Learning Wednesday 1 Dec

Speaker

Lakshya Malhotra (University of Wisconsin-Madison)

Description

Multi-channel fluctuation diagnostics capture the spatial patterns of high-bandwidth plasma dynamics. The paper reports on an effort to develop machine learning (ML) models for the real-time identification of edge-localized-mode (ELM) events and the turbulence properties of confinement regimes using the 2-dimensional Beam Emission Spectroscopy (BES) system at DIII-D. The 64-channel BES system captures plasma density perturbations at a 1 MHz frame rate. The “edge ML” models will be deployed on high-throughput hardware such as a Field Programmable Gate Array (FPGA) accelerator for integration in the real-time plasma control system (PCS). The models will generate reduced signals that correspond to ELM activity and turbulence dynamics, and the real-time PCS will learn to avoid ELM regimes and to achieve and sustain advanced confinement regimes such as the wide pedestal QH-mode.
The ELM onset dataset contains about 400 hand-labelled ELM events each consisting of about 4-8 millisecond time series data from all the 64 BES channels. Before model training, about 15% of the total ELM events are kept for cross-validation and 10% of them are reserved as previously unseen (test) data. The remaining data is used for training various machine learning models ranging from logistic regression to deep convolutional neural networks. The entire training and cross-validation workflow makes use of time series data truncated beyond 75 µs after the leading edge of the active ELM in each ELM event. Instead of sequential training on the time series, the input data windows, and their corresponding labels, are sampled at random from the ELM events. This whole data pre-processing and training pipeline is repeated for different values of label lookaheads from 10-100 µs to assess the predictive power of the trained models in context of predicting ELM onset and turbulence activity. The model performance on the test data is analyzed using metrics like precision, recall, F1-score and area under the receiver operating characteristics curve (ROC curve) showing the variation of True Positive Rate (TPR) with False Positive Rate (FPR). The model robustness is also assessed using custom metrics such as macro predictions in which the entire time series is divided into the regions of pre-active ELM (region on the left of the first vertical line in the left figure), and active ELM (region between two vertical lines) and model predictions are analyzed in each of the regions separately which are then combined. Various feature engineering techniques based on the max and average pooling or the output feature map from a convolutional neural network are also employed and tested against classical machine learning algorithms such as logistic regression, random forests and gradient boosting. As compared to the baseline results of around 0.83 using classical machine learning algorithms, deep convolutional neural networks yield an AUC-ROC score of about 0.92 on the test data (right figure). Preliminary results also indicate that deep convolutional models can predict ELM onset up to 100 µs in advance using data windows as small as 16 µs. The predictive performance suggests that ELM precursors can be identified and detected in high-bandwidth data streams for ELM onset warning. The paper also explores other data pre-processing techniques like spatial and time gradients as color channels and different neural network architectures such as autoencoders to perform multiple classification and forecasting tasks like ELM prediction, LH transition forecasting, and control for enhanced confinement regimes such as the wide pedestal QH-mode. Autoencoders can help to compress the spatial-temporal information from the input data windows in a low-dimension feature space which can be used for self-supervised feature engineering with unlabelled datasets.
Supported by US DOE Grant No. DE-SC0021157, DE-FG02-08ER54999, and DE-FC02-04ER54698.

Country or International Organisation United States of America
Affiliation University of Wisconsin-Madison

Primary author

Lakshya Malhotra (University of Wisconsin-Madison)

Co-authors

David Smith (University of Wisconsin-Madison) Mr Prannav Arora (University of Wisconsin-Madison) George McKee (University of Wisconsin-Madison) Zheng Yan (University of Wisconsin-Madison) Mr Jeffrey Zimmerman (University of Wisconsin-Madison) Mark Boyer (Princeton Plasma Physics Laboratory) Mr Ryan Coffee (SLAC National Accelerator Lab) Mr Azarakhsh Jalalvand (Ghent University) Egemen Kolemen (Princeton University)

Presentation materials