Detailed Aggregation of Local Descriptors Extracted from Image Patches

This diagram provides a detailed view of the process of extracting local descriptors from regular image patches and their aggregation using transformers.