AI to Enable Accurate Modelling of Data Storage System Performance
Researchers at the HSE Faculty of Computer Science have developed a new approach to modelling data storage systems based on generative machine learning models. This approach makes it possible to accurately predict the key performance characteristics of such systems under various conditions. Results have been published in the IEEE Access journal.
Data storage systems play an important role in today’s digital world, as they are responsible for the safety and prompt availability of vast amounts of information. These systems consist of many components, including controllers, HDD and SSD disks, as well as cache memory, which work together to ensure fast and efficient operation. To achieve optimal performance, it is essential to accurately predict how these systems will function in different scenarios, such as when the load on the system changes.
Researchers at the HSE Faculty of Computer Science developed a new approach to modelling data storage system performance, which relies on generative machine learning models. The authors proposed a method that provides high-precision predictions of the key performance characteristics of the systems: the number of input/output operations per second (IOPS) and latency.
The modelling includes two stages. First, the scientists collect data by measuring the system’s performance under various loads and configurations. This data is then fed to two special generative models: the CatBoost regression model and the normalizing flow model. CatBoost works well with tabular data and can accurately predict average values and performance deviations. The normalizing flow model produces a complete distribution of possible outcomes, taking into account data uncertainties and variability.
Mikhail Hushchyn
‘One of the main advantages of our method is that it does not require detailed knowledge of the internal structure of the system components. This is often impossible due to the manufacturers’ trade secrets. Instead, our generative models are trained directly on real-world data. For instance, in our study, we trained a model using 300,000 measurements. This makes our approach versatile and applicable to any type of data storage system,’ says study author Mikhail Hushchyn, a senior research fellow at the HSE Faculty of Computer Science.
The researchers tested the accuracy of the proposed approach using Little's law, a fundamental principle of queuing theory. According to test results, these predictions are highly consistent with real observations: prediction errors range from just 4–10% for IOPS and 3–16% for latency, while the correlation with the observed values reaches 0.99.
Aziz Temirkhanov
‘Our proposed approach opens up broad prospects for optimising and planning the operation of data centres. It makes it possible to predict the behaviour of the system amid load changes, identify potential performance issues, and optimise power consumption. Furthermore, expensive physical experiments are no longer required for accurate modelling,’ stated Aziz Temirkhanov, a junior research fellow at the Laboratory of Methods for Big Data Analysis.
The experimental code and measurements of the storage system performance are publicly available.
The research was carried out within the Mirror Laboratories project of HSE University on improving the efficiency of data centres and data storage systems using artificial intelligence methods.
See also:
Researchers Present the Rating of Ideal Life Partner Traits
An international research team surveyed over 10,000 respondents across 43 countries to examine how closely the ideal image of a romantic partner aligns with the actual partners people choose, and how this alignment shapes their romantic satisfaction. Based on the survey, the researchers compiled two ratings—qualities of an ideal life partner and the most valued traits in actual partners. The results have been published in the Journal of Personality and Social Psychology.
Trend-Watching: Radical Innovations in Creative Industries and Artistic Practices
The rapid development of technology, the adaptation of business processes to new economic realities, and changing audience demands require professionals in the creative industries to keep up with current trends and be flexible in their approach to projects. Between April and May 2025, the Institute for Creative Industries Development (ICID) at the HSE Faculty of Creative Industries conducted a trend study within the creative sector.
From Neural Networks to Stock Markets: Advancing Computer Science Research at HSE University in Nizhny Novgorod
The International Laboratory of Algorithms and Technologies for Network Analysis (LATNA), established in 2011 at HSE University in Nizhny Novgorod, conducts a wide range of fundamental and applied research, including joint projects with large companies: Sberbank, Yandex, and other leaders of the IT industry. The methods developed by the university's researchers not only enrich science, but also make it possible to improve the work of transport companies and conduct medical and genetic research more successfully. HSE News Service discussed work of the laboratory with its head, Professor Valery Kalyagin.
Children with Autism Process Sounds Differently
For the first time, an international team of researchers—including scientists from the HSE Centre for Language and Brain—combined magnetoencephalography and morphometric analysis in a single experiment to study children with Autism Spectrum Disorder (ASD). The study found that children with autism have more difficulty filtering and processing sounds, particularly in the brain region typically responsible for language comprehension. The study has been published in Cerebral Cortex.
HSE Scientists Discover Method to Convert CO₂ into Fuel Without Expensive Reagents
Researchers at HSE MIEM, in collaboration with Chinese scientists, have developed a catalyst that efficiently converts CO₂ into formic acid. Thanks to carbon coating, it remains stable in acidic environments and functions with minimal potassium, contrary to previous beliefs that high concentrations were necessary. This could lower the cost of CO₂ processing and simplify its industrial application—eg in producing fuel for environmentally friendly transportation. The study has been published in Nature Communications.
HSE Scientists Reveal How Staying at Alma Mater Can Affect Early-Career Researchers
Many early-career scientists continue their academic careers at the same university where they studied, a practice known as academic inbreeding. A researcher at the HSE Institute of Education analysed the impact of academic inbreeding on publication activity in the natural sciences and mathematics. The study found that the impact is ambiguous and depends on various factors, including the university's geographical location, its financial resources, and the state of the regional academic employment market. A paper with the study findings has been published in Research Policy.
Group and Shuffle: Researchers at HSE University and AIRI Accelerate Neural Network Fine-Tuning
Researchers at HSE University and the AIRI Institute have proposed a method for quickly fine-tuning neural networks. Their approach involves processing data in groups and then optimally shuffling these groups to improve their interactions. The method outperforms alternatives in image generation and analysis, as well as in fine-tuning text models, all while requiring less memory and training time. The results have been presented at the NeurIPS 2024 Conference.
When Thoughts Become Movement: How Brain–Computer Interfaces Are Transforming Medicine and Daily Life
At the dawn of the 21st century, humans are increasingly becoming not just observers, but active participants in the technological revolution. Among the breakthroughs with the potential to change the lives of millions, brain–computer interfaces (BCIs)—systems that connect the brain to external devices—hold a special place. These technologies were the focal point of the spring International School ‘A New Generation of Neurointerfaces,’ which took place at HSE University.
New Clustering Method Simplifies Analysis of Large Data Sets
Researchers from HSE University and the Institute of Control Sciences of the Russian Academy of Sciences have proposed a new method of data analysis: tunnel clustering. It allows for the rapid identification of groups of similar objects and requires fewer computational resources than traditional methods. Depending on the data configuration, the algorithm can operate dozens of times faster than its counterparts. Thestudy was published in the journal Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia.
Researchers from HSE University in Perm Teach AI to Analyse Figure Skating
Researchers from HSE University in Perm have developed NeuroSkate, a neural network that identifies the movements of skaters on video and determines the correctness of the elements performed. The algorithm has already demonstrated success with the basic elements, and further development of the model will improve its accuracy in identifying complex jumps.