In this case, video indexing is the process of building a TAPe index by obtaining TAPe attributes from a video file. This is the most resource-intensive task accounting for 99.9% of the process, so we started looking for ways to do it faster. In search of an optimal solution, we compared the video indexing process on different devices and looked at the efficiency of NVIDIA's CPU and GPU, among others.
Through experiments, we show that decoder load tends to 100%, GPU load decreases with the increase in the number of card cores, while the time required to index videos to TAPe format does not change when the number of cores is increased. If we were to load all those thousands of cores at 100%, we would need thousands of encoders/decoders to provide the cores with the volume of information enough to keep them running at full capacity. But such video cards don't exist. All of that is achieved by maximizing the informative value of TAPe attributes, while their number remains minimal and requires few computational resources.
Experiments with other types of video cards have shown that accelerating the process of indexing videos to TAPe format is only possible through the increase in the number of decoders embedded in the motherboard. And still, GPU load remains at small values and drops as the number of cores on the motherboard increases. So the bottleneck of the motherboards is the number of decoders rather than the number of cores. Indexing a file to TAPe format requires very few resources in terms of the number of cores as compared to the resources required for video decoding.
A built-in, efficiently programmed decoder returns so little data (frames, pictures, bmp) that only a few GPU cores are enough to build a TAPe index based on a video. At the same time, most operations related to videos (processing, editing, gaming) even require as few as one decoder. That’s why today’s ‘arms race’ is being waged precisely around the number of cores. However, TAPe, with its ‘innate’ meaningful attributes, needs as many decoders as possible, not cores. That means the architecture of any device tailored to TAPe, with its capabilities, must be fundamentally different.
In tests, we used GeForce GTX 1660 and GeForce RTX 3090 graphics cards.
Experiments were conducted using 3 different hour-long video fragments at different resolutions of 240p, 360p, 480p, 720p, and 1080p in mp4 format.
In the demo video, the experiments are grouped as follows: 10 experiments at 5 different resolutions for the 1st hour-long video fragment and then the same for the 2nd and the 3rd video fragments, with 10 experiments for each of them.