TY - GEN
T1 - Robust Reinforcement Learning-based Wald-type Detector for Massive MIMO Radar
AU - Ahmed, Aya Mostafa
AU - Fortunati, Stefano
AU - Sezgin, Aydin
AU - Greco, Maria S.
AU - Gini, Fulvio
N1 - Publisher Copyright:
© 2021 European Signal Processing Conference. All rights reserved.
PY - 2021/1/1
Y1 - 2021/1/1
N2 - The two basic performance indices characterizing the multi-target detection task in a radar system are the probability of false alarm (PFA) and the probability of detection PD. It is well-known that, when the disturbance model (i.e., clutter and noise) is perfectly known, the Neyman-Pearson (NP) detector provides the best decision strategy, i.e., the detector that maximizes the PD, while keeping a constant PFA. However, in practical scenarios, the a priori knowledge of the statistical model of the disturbance is rarely available. In this paper we investigate the robustness of a reinforcement learning (RL) based Wald-type test to guarantee reliable detection performance even without knowledge of the disturbance distribution. Specifically, the constant false alarm Rate (CFAR) property is obtained by applying tools from misspecified asymptotic statistics, while the PD is maximized by exploiting an RL-based scheme.
AB - The two basic performance indices characterizing the multi-target detection task in a radar system are the probability of false alarm (PFA) and the probability of detection PD. It is well-known that, when the disturbance model (i.e., clutter and noise) is perfectly known, the Neyman-Pearson (NP) detector provides the best decision strategy, i.e., the detector that maximizes the PD, while keeping a constant PFA. However, in practical scenarios, the a priori knowledge of the statistical model of the disturbance is rarely available. In this paper we investigate the robustness of a reinforcement learning (RL) based Wald-type test to guarantee reliable detection performance even without knowledge of the disturbance distribution. Specifically, the constant false alarm Rate (CFAR) property is obtained by applying tools from misspecified asymptotic statistics, while the PD is maximized by exploiting an RL-based scheme.
KW - Cognitive radar
KW - Massive MIMO
KW - Reinforcement learning
KW - Robust statistics
KW - Wald test
UR - https://www.scopus.com/pages/publications/85123180504
U2 - 10.23919/EUSIPCO54536.2021.9616093
DO - 10.23919/EUSIPCO54536.2021.9616093
M3 - Conference contribution
AN - SCOPUS:85123180504
T3 - European Signal Processing Conference
SP - 846
EP - 850
BT - 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings
PB - European Signal Processing Conference, EUSIPCO
T2 - 29th European Signal Processing Conference, EUSIPCO 2021
Y2 - 23 August 2021 through 27 August 2021
ER -