This article examines the growing threat of web scraping (parsing) as a form of automated cyberattack, particularly aimed. Although scraping publicly available data is often legal, its misuse can lead to serious consequences, including server overload, data breaches and intellectual property infringement. Recent court cases against OpenAI and ChatGPT highlight the legal uncertainty associated with unauthorized data collection. The study presents a dual approach to combat malicious scraping. Traffic Classification Model - a machine learning based solution using Random Forest algorithms results in performance that achieves 89% accuracy in distinguishing between legitimate and malicious bot traffic, enabling early detection of scraping attempts. Data Deception Technique - the countermeasure dynamically modifies HTML content to convey false information to scrapers while maintaining the original look of the page. This technique prevents data collection without affecting the user experience. Performance results include real-time traffic monitoring, dynamic page obfuscation, and automatic response systems. The proposed system demonstrates effectiveness in mitigating the risks associated with scraping and emphasizes the need for adaptive cybersecurity measures in evolving digital technologies.
Keywords: parsing, automated attacks, data protection, bot detection, traffic classification, machine learning, attack analysis, data spoofing, web security
In this paper, methods for estimating one's own position from a video image are considered. A robust two-stage algorithm for reconstructing the scene structure from its observed video images is proposed. In the proposed algorithm, at the feature extraction and matching stage, a random sample based on the neighborhood graph cuts is used to select the most probable matching feature pairs. At the nonlinear optimization stage, an improved optimization algorithm with an adaptive attenuation coefficient and dynamic adjustment of the trust region is used. Compared with the classical Levenberg-Marquard (LM) algorithm, global and local convergence can be better balanced. To simplify the system's decisions, the Schur complement method is used at the group tuning stage, which allows for a significant reduction in the amount of computation. The experiments confirmed the operability and effectiveness of the proposed algorithm.
Keywords: 3D reconstruction,graph-cut, Structure-from-Motion (SfM),RANSAC,Bundle Adjustment optimization,Levenberg-Marquardt algorithm,Robust feature matching
The paper considers a lightweight modified version of the YOLO-v5 neural network, which is used to recognize road scene objects in the task of controlling an unmanned vehicle. In the proposed model, the pooling layer is replaced by the ADown module in order to reduce the complexity of the model. The C2f module is added as a feature extraction module to improve accuracy by combining features. Experiments using snowy road scenes are presented and the effectiveness of the proposed model for object recognition is demonstrated.
Keywords: road scene object recognition, YOLOv5, Adown, C2f, deep learning, pooling layer, neural network, lightweight network, dataset
The characteristics of a submersible induction motor are described with sufficient reliability for practice by the theory of multi-motor electric drive. In this case, the classical circuit of a submersible induction motor is a coupled system of several equivalent-T circuits. In turn, this significantly increases its computational complexity and reduces the speed of ACS. It is proposed to construct a mathematical model of the submersible electric motor in the form of polynomials with significantly higher speed using the methods of experiment planning. In the area of applicability, the differences in the estimation of energy performance do not exceed 1.1%, between the proposed models and classical equivalent-T circuits.
Keywords: automated control system, mathematical model, polynomial, mean absolute percentage error, computational complexity, design of experiment, scatter diagram, modal interval, submersible electrical motor, rotor package
The article presents a mathematical model that formalizes the process of managing the scientific activities of an organization. The model based on the theory of queuing. The principle of death - reproduction used in the construction. For a special case, a graph of states and a system of Kolmogorov differential equations are given. The intensity of the input and output streams are time-dependent non-stationary streams. The model allows us to consider various structures and schemes of interaction between scientific departments and various sce-narios for setting scientific tasks and the intensity of their solution by employees of the organization. A software package for decision-making has developed for the model for optimal management of the scientific activities of the department. The article presents one of the results of an experimental and model study of the influence of the motivational component and the level of competence of employees. Graphs of the system states given for the resulting solution. The research can used for comprehensive evaluation of results, planning, resource allocation and management of scientific activities.
Keywords: diversification of management, production diversification, financial and economic purposes of a diversification, technological purposes of ensuring flexibility of production
The article presents the results of a study on the effectiveness of the hashing algorithms Argon2, Scrypt, and Bcrypt in the context of developing web applications with user registration and authentication features. The main focus of this research is on analyzing the algorithms' resilience to brute-force attacks, hardware attacks (GPU/ASIC), as well as evaluating their computational performance. The results of the experiments demonstrate the advantages of Scrypt in terms of balancing execution time and security. Recommendations for selecting algorithms based on security and performance requirements are also provided.
Keywords: hashing algorithm, user registration interface, user authentication interface, privacy protection
The article discusses the structure and principle of operation of an improved centrifugal unit for mixing bulk materials. A special feature of which is the ability to control mixing modes. Due to its design, the selection of a rational position of the bump makes it possible to provide such conditions for the impact interaction of particle flows, in which a high-quality homogeneous mixture of components is formed, the particles of which have different sizes, shapes and other parameters. To characterize the resulting mixture, the coefficient of heterogeneity was used, the conclusion of which is based on a probabilistic approach. A computational scheme of the rarefied flow formation process is given. An expression is derived for calculating the coefficient of heterogeneity when mixing bulk media, the particles of which have different sizes, shapes and other parameters. The research conducted in the article allows not only to predict the quality of the resulting mixture, but also to identify the factors that have the greatest impact on achieving the required uniformity.
Keywords: aggregate, bulk media, mixing, coefficient of heterogeneity, concentration, design scheme, particle size
The problem of substantiating the grid step of an electronic map used to establish the fact that a vehicle, whose coordinates are read from an on-board GPS sensor, enters the control area is considered. The proposed decision support system determines the maximum allowable step for marking a geographical area for subsequent information processing, in which the amount of resulting data is close to the minimum, taking into account the preservation of processing time and acceptable the level of error of the data for analytics. To solve the problem, it is proposed to use the regression analysis apparatus to determine the dependence of the expected verdict error on the grid step of the electronic map. The analysis of residual variances was used to prove the applicability of the proposed device. The Laplace formula was used to estimate the confidence interval of erroneous verdicts.
Keywords: grid step, confidence probability, percentage of erroneous verdicts, regression analysis, coefficient of determination
The article discusses a software module developed by the authors for automatic generation of program code based on UML diagrams. The relevance of developing this module is due to the limitations of existing foreign code generation tools related to functionality, ease of use, support for modern technologies, as well as their unavailability in Russian Federation. The module analyzes JSON files obtained by exporting UML diagrams from the draw.io online service and converts them into code in a selected programming language (Python, C++, Java) or DDL scripts for DBMS (PostgreSQL, Oracle, MySQL). The Python language and the Jinja2 template engine were used as the main development tools. The operation of the software module is demonstrated using the example of a small project "Library Management System". During the study, a series of tests were conducted on automatic code generation based on the architectures of software information systems developed by students of the Software Engineering bachelor's degree program in the discipline "Design and Architecture of Software Systems". The test results showed that the code generated using the developed module fully complies with the original UML diagrams, including the structure of classes, relationships between them, as well as the configuration of the database and infrastructure (Docker Compose). The practical significance of the investigation is that the proposed concept of generating program code based on visual models of UML diagrams built in the popular online editor draw.io significantly simplifies the development of software information systems, and can be used for educational purposes.
Keywords: code generation, automation, python, jinja2, uml diagram, json, template engine, parsing, class diagram, database, deployment diagram
Differential-algebraic equations for describing the motion of a plane-parallel robot-manipulator are investigated. The dynamic model is constructed using the Lagrange equation and the substructure method. The design of a control system regulator using the sliding mode method is considered. The control accuracy is tested on a model of a 3-RRR plane-parallel robot . It consists of three kinematic chains, each of which has two links with three rotational joints. To study the efficiency of the controller, a circular trajectory is used as the target motion for the multibody system. The considered control system for a plane-parallel robot is capable of solving problems of movement and ensuring high positioning accuracy.
Keywords: control, plane-parallel robot, kinematic characteristics, dynamic model, differential-algebraic equations, constraint equation, controller, sliding mode, Lyapunov function, program trajectory
This article presents a structured approach to deploying and integrating Grafana, Loki, and Alloy in Kubernetes environments. The work was performed using a cluster managed via Kubespray. The architecture is focused on ensuring external availability, high fault tolerance, and universality of use.
Keywords: monitoring, ocestration, containerization, Grafana, Loki, Kubernetes, Alloy
Changes in external conditions, parameters of object functioning, relationships between system elements and system connections with the supersystem lead to a decrease in the accuracy of the artificial intelligence models results, which is called model degradation. Reducing the risk of model degradation is relevant for electric power engineering tasks, the peculiarity of which is multifactor dependencies in complex technical systems and the influence of meteorological parameters. Therefore, automatic updating of models over time is a necessary condition for building user confidence in forecasting systems in power engineering tasks and industry implementations of such systems. There are various methods used to prevent degradation, including an algorithm for detecting data drift, an algorithm for updating models, their retraining, additional training, and fine-tuning. This article presents the results of a study of drift types, their systematization and classification by various features. The solution options that developers need to make when creating intelligent forecasting systems to determine a strategy for updating forecast models are formalized, including update trigger criteria, model selection, hyperparameter optimization, and the choice of an update method and data set formation. An algorithm for forming a strategy for automatic updating of artificial intelligence models is proposed and practical recommendations are given for developers of models in problems of forecasting time series in the power industry, such as forecasting electricity consumption, forecasting the output of solar, wind and hydroelectric power plants.
Keywords: time series forecasting, artificial intelligence, machine learning, trusted AI system, model degradation, data drift, concept drift
This study presents an effective vision -based method to accurately identify predator species from camera trap images in protected Uganda areas. To address the challenges of object detection in natural environments, we propose a new multiphase deep learning architecture that combines extraction of various features with concentrated edge detection. Compared to previous approaches, our method offers 90.9% classification accuracy, significantly requiring fewer manual advertising training samples. Background pixels were systematically filtered to improve model performance under various environmental conditions. This work advances in both biology and computational vision, demonstrating an effective and data-oriented approach to automated wildlife monitoring that supports science -based conservation measures.
Keywords: deep learning, camera trap, convolutional neural network, dataset, predator, kidepo national park, wildlife
This paper provides a survey of metrics used to assess the quality of images generated by generative models. Specialized metrics are required to objectively evaluate image quality. A comparative analysis showed that a combination of different metrics is necessary for a comprehensive evaluation of generation quality. Perceptual metrics are effective for assessing image quality from the perspective of machine systems, while metrics evaluating structure and details are useful for analyzing human perception. Text-based metrics allow for the assessment of image-text alignment but cannot replace metrics focused on visual or structural evaluation. The results of this study will be beneficial for specialists in machine learning and computer vision, as well as contribute to the improvement of generative algorithms and the expansion of diffusion model applications.
Keywords: deep learning, metric, generative model, image quality, image
The article provides a justification for the concept of a folding system for a prefabricated residential module based on wooden structures. An analysis of foreign analogues of prefabricated transformable wooden buildings and an assessment of the possibility of their use in northern climatic conditions has been performed. A transformation system for a prefabricated wooden module for use in northern and Arctic conditions is proposed and substantiated.
Keywords: low-rise housing construction, transformation, transformation of low-rise residential buildings, prefabricated transformable buildings, pre-manufactured at the factory, high degree of factory readiness