Aggregation of Local Descriptors Extracted from Image Patches

This diagram illustrates the process of extracting local descriptors from regular image patches and their aggregation using transformers.