Abstract: Most current image captioning models rely on the autoregressive approach, which unfortunately results in significant inference delays that hinder their practical use. In contrast, ...
Abstract: Accurate multi-label image classification is essential for real-world applications, especially in scenarios with long-tailed class distributions, where some classes appear frequently while ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results