AI to Enable Accurate Modelling of Data Storage System Performance

Researchers at the HSE Faculty of Computer Science have developed a new approach to modelling data storage systems based on generative machine learning models. This approach makes it possible to accurately predict the key performance characteristics of such systems under various conditions. Results have been published in the IEEE Access journal.
Data storage systems play an important role in today’s digital world, as they are responsible for the safety and prompt availability of vast amounts of information. These systems consist of many components, including controllers, HDD and SSD disks, as well as cache memory, which work together to ensure fast and efficient operation. To achieve optimal performance, it is essential to accurately predict how these systems will function in different scenarios, such as when the load on the system changes.
Researchers at the HSE Faculty of Computer Science developed a new approach to modelling data storage system performance, which relies on generative machine learning models. The authors proposed a method that provides high-precision predictions of the key performance characteristics of the systems: the number of input/output operations per second (IOPS) and latency.
The modelling includes two stages. First, the scientists collect data by measuring the system’s performance under various loads and configurations. This data is then fed to two special generative models: the CatBoost regression model and the normalizing flow model. CatBoost works well with tabular data and can accurately predict average values and performance deviations. The normalizing flow model produces a complete distribution of possible outcomes, taking into account data uncertainties and variability.
Mikhail Hushchyn
‘One of the main advantages of our method is that it does not require detailed knowledge of the internal structure of the system components. This is often impossible due to the manufacturers’ trade secrets. Instead, our generative models are trained directly on real-world data. For instance, in our study, we trained a model using 300,000 measurements. This makes our approach versatile and applicable to any type of data storage system,’ says study author Mikhail Hushchyn, a senior research fellow at the HSE Faculty of Computer Science.
The researchers tested the accuracy of the proposed approach using Little's law, a fundamental principle of queuing theory. According to test results, these predictions are highly consistent with real observations: prediction errors range from just 4–10% for IOPS and 3–16% for latency, while the correlation with the observed values reaches 0.99.
Aziz Temirkhanov
‘Our proposed approach opens up broad prospects for optimising and planning the operation of data centres. It makes it possible to predict the behaviour of the system amid load changes, identify potential performance issues, and optimise power consumption. Furthermore, expensive physical experiments are no longer required for accurate modelling,’ stated Aziz Temirkhanov, a junior research fellow at the Laboratory of Methods for Big Data Analysis.
The experimental code and measurements of the storage system performance are publicly available.
The research was carried out within the Mirror Laboratories project of HSE University on improving the efficiency of data centres and data storage systems using artificial intelligence methods.
See also:
HSE Scientists Use MEG for Precise Language Mapping in the Brain
Scientists at the HSE Centre for Language and Brain have demonstrated a more accurate way to identify the boundaries of language regions in the brain. They used magnetoencephalography (MEG) together with a sentence-completion task, which activates language areas and reveals their functioning in real time. This approach can help clinicians plan surgeries more effectively and improve diagnostic accuracy in cases where fMRI is not the optimal method. The study has been published in the European Journal of Neuroscience.
For the First Time, Linguists Describe the History of Russian Sign Language Interpreter Training
A team of researchers from Russia and the United Kingdom has, for the first time, provided a detailed account of the emergence and evolution of the Russian Sign Language (RSL) interpreter training system. This large-scale study spans from the 19th century to the present day, revealing both the achievements and challenges faced by the professional community. Results have been published in The Routledge Handbook of Sign Language Translation and Interpreting.
HSE Scientists Develop DeepGQ: AI-based 'Google Maps' for G-Quadruplexes
Researchers at the HSE AI Research Centre have developed an AI model that opens up new possibilities for the diagnosis and treatment of serious diseases, including brain cancer and neurodegenerative disorders. Using artificial intelligence, the team studied G-quadruplexes—structures that play a crucial role in cellular function and in the development of organs and tissues. The findings have been published in Scientific Reports.
New Catalyst Maintains Effectiveness for 12 Hours
An international team including researchers from HSE MIEM has developed a catalyst that enables fast and low-cost hydrogen production from water. To achieve this, the scientists synthesised nanoparticles of a complex oxide containing six metals and anchored them onto various substrates. The catalyst supported on reduced graphene layers proved to be nearly three times more efficient than the same oxide without a substrate. This development could significantly reduce the cost of hydrogen production and accelerate the transition to green energy. The study has been published in ACS Applied Energy Materials. The work was carried out under a grant from the Russian Science Foundation.
HSE Strategic Technological Projects in 2025
In 2025, HSE University continued its participation in the Priority 2030 Strategic Academic Leadership Programme, maintaining a strong focus on technological leadership in line with the programme’s updated framework. A key element of the university’s technological leadership strategy is its Strategic Technological Projects (STPs), aimed at creating in-demand, knowledge-intensive products and services.
School Students Master Communication with GigaChat at HSE and Sber Hackathon
In late December 2025, a unique competition was held at HSE University where participants solved challenges not by writing code, but solely by interacting with Sber’s GigaChat artificial intelligence model. The Improm(p)tu hackathon was an experiment less about programming skills than a new form of literacy: the ability to work effectively with AI by translating complex problems into a language neural networks can understand.
HSE Researchers Offer Guidance to Prevent Undergraduate Burnout
Researchers at the HSE Institute of Education have identified how much time students should ideally devote to their studies, extracurricular activities, and personal life to maintain strong academic performance without compromising their mental health. An analysis of responses from 2,753 students, combined with their actual academic results, revealed several risk factors—such as excessive homework—as well as positive factors, including sufficient sleep, regular exercise, and moderate participation in projects. Based on these findings, the researchers developed practical recommendations for both students and universities. The paper has been published in the European Journal of Education.
Scientists Discover Why Parents May Favour One Child Over Another
An international team that included Prof. Marina Butovskaya from HSE University studied how willing parents are to care for a child depending on the child’s resemblance to them. The researchers found that similarity to the mother or father affects the level of care provided by parents and grandparents differently. Moreover, this relationship varies across Russia, Brazil, and the United States, reflecting deep cultural differences in family structures in these countries. The study's findings have been published in Social Evolution & History.
When a Virus Steps on a Mine: Ancient Mechanism of Infected Cell Self-Destruction Discovered
When a virus enters a cell, it disrupts the cell’s normal functions. It was previously believed that the cell's protective response to the virus triggered cellular self-destruction. However, a study involving bioinformatics researchers at HSE University has revealed a different mechanism: the cell does not react to the virus itself but to its own transcripts, which become abnormally long. The study has been published in Nature.
Researchers Identify Link between Bilingualism and Cognitive Efficiency
An international team of researchers, including scholars from HSE University, has discovered that knowledge of a foreign language can improve memory performance and increase automaticity when solving complex tasks. The higher a person’s language proficiency, the stronger the effect. The results have been published in the journal Brain and Cognition.


