Towards a Functional Record Theory: Applications to Anomaly Detection

Mines Saint-Étienne

Theme Data analytics & Artificial Intelligence

Record theory

Infinite-dimensional probability

RKHS

Anomaly detection

Functional Data Analysis

Practical information

Thesis supervisor

Anis HOAYEK

Supervisors

Supervisor: Anis HOAYEK, Associate Professor (HDR and expert in record theory)
Co-Advisor 1: Mireille Batton-Hubert, Professor (Expert in data science for anomaly detection)
Co-Advisor 2: Xavier Bay, Associate Professor (Expert in functional data analysis)

Thesis supervisory team

Institut Henri Fayol – Département Génie Mathématique et Industriel (GMI).
LIMOS (UMR CNRS 6158)
IT’M Factory

More information

Description

Record theory provides key probabilistic tools to understand extreme events : phenomena that exceed typical fluctuations and drive critical decisions in science, engineering, and society. It is thus highly relevant for anomaly detection, as anomalies frequently occur at extremes. While classical record theory is well understood in finite-dimensional settings [1,2], the surge of functional and high- dimensional data has created a significant theoretical gap [3,4]. Modern data sources, such as industrial sensor trajectorie, environmental curves, or biomedical signals, are functional objects in infinite-dimensional spaces [4,5,6]. Reducing these data to finite-dimensional vectors often destroys their temporal structures and extreme behaviors, limiting the ability to detect rare functional anomalies, i.e. unusual trajectories or patterns deviating from typical system behavior [4,7]. 

Research on record theory in infinite-dimensional spaces, such as Hilbert and RKHS, remains limited [8]. Classical record models struggle to capture the complex dependencies inherent in high- dimensional data. To overcome this limitation, developing “functional record” approaches inspired by functional data analysis and RKHS-based modeling appears to be a promising direction [8,9]. 

This project aims to establish the mathematical foundations of functional record theory, enabling a deeper probabilistic understanding of extreme patterns in functional data streams. It marks a major departure from the state of the art by extending record theory beyond Euclidean spaces, using tools from functional analysis and RKHS geometry [3,8]. The resulting framework integrates record statistics into functional anomaly detection, a domain where classical record theory falls short [5,10,11,12]. By capturing rare events within the geometry of infinite-dimensional data, it provides interpretable and robust modeling: unlike pointwise approaches, it considers entire trajectories, preserving temporal and geometric structure while leveraging the full informational content. This perspective uncovers rare functional patterns invisible to classical methods [3,7,10,11] and enables the classification of anomalies, from short-term local deviations to persistent or long-term contextual anomalies. Moreover, its solid mathematical formulation simplifies existing models and reduces computational complexity compared to data-driven alternatives, supporting online, unsupervised detection without training. Hence, the project opens a new research direction at the intersection of record theory, extreme value theory, and functional data analysis.

 Applications are broad and impactful. In the industrial sector, functional record theory detects rare operational profiles in sensor networks and cyber-physical systems, and supports intelligent energy monitoring and continuous quality control in complex manufacturing [10,11,12]. In climate science, it offers a novel description of record-breaking temperature or pollution curves [4]. Last but not least, in healthcare, it can identify unusual physiological patterns (ECG curves and brain signals) that pointwise methods overlook [5,12]. These use cases strongly align with IMT’s strategic goals in responsible industry, energy transition, and health engineering.

Bibliography

1. Hoayek, A. S., Ducharme, G. R., & Khraibani, Z. (2017). Distribution-free inference in record series. Extremes, 20(3), 585-603.
2. Hoayek, A. (2016).. Estimation des paramètres pour des modèles adaptés aux séries de records (Doctoral dissertation, Université Montpellier).
3. Hsing, T., & Eubank, R. (2015). Theoretical Foundations of Functional Data Analysis, with an Introduction to Linear Operators. Wiley. [DOI:10.1002/9781118762547]
4. French, J., Kokoszka, P., Stoev, S., & Hall, L. (2019). Quantifying the risk of heat waves using extreme value theory and spatio-temporal functional data. Computational Statistics & Data Analysis, 135, 107–122
5. Li, B., & Solea, E. (2018). A nonparametric graphical model for functional data with application to brain networks based on fMRI. Journal of the American Statistical Association, 113(522), 1373–1386.
6. Wittenberg, P., Neumann, L., & Mendler, A. (2025). Covariate-adjusted functional data analysis for structural health monitoring. Data-Centric Engineering. Cambridge University Press.
7. Müller, H.G. (2016). Peter Hall, functional data analysis and random objects. Annals of Statistics, 44(5), 2017–2021
8. Liu, M. (2024). Statistical Learning and Inference for Functional Predictor Models via Reproducing Kernel Hilbert Space. University of Alberta.
9. Wang, J., Wong, R.K.W., & Zhang, X. (2022). Low-rank covariance function estimation for multidimensional functional data. Journal of the American Statistical Association, 117(540), 1921–1937
10. Kamel, M., Hoayek, A., & Batton-Hubert, M. (2025, February). Coupling Variable Selection and Anomaly Detection: Record-Based Approach. In 2025 5th IEEE Middle East and North Africa Communications Conference (MENACOMM) (pp. 1-6). IEEE.
11. Kheirallah, R., Hoayek, A., Batton-Hubert, M., & Burlat, P. (2025). Anomaly Detection in a Production Line: Statistical Learning Approach and Industrial Application. Procedia CIRP, 134, 1017-1022.
12. Li, B., & Song, J. (2017). Nonlinear sufficient dimension reduction for functional data. Annals of Statistics, 45(3), 1757–1787.