News

The images from 4K-Face dataset is collected from the Internet. Following WIDER FACE event categories, keywords, such as travel, surgeons, and celebration, are used to retrieve more than 25,000 images ...
Update EVA ViT (closest match) to support Perception Encoder models (https://arxiv.org/abs/2504.13181) from Meta, loading Hub weights but I still need to push ...