"This Person Does Exist" displays the dataset and hidden work behind the famous StyleGAN architecture, that can generate stochastic faces. The dataset with the name Flickr-Faces-HQ (FFHQ) is a collection of high resolution images made available by the NVIDIA Corporation under creative commons license in 2018.
"The dataset consists of 70,000 high-quality PNG images at 1024×1024 resolution and contains considerable variation in terms of age, ethnicity and image background. It also has good coverage of accessories such as eyeglasses, sunglasses, hats, etc. The images were crawled from Flickr, thus inheriting all the biases of that website, and automatically aligned and cropped using dlib. Only images under permissive licenses were collected. Various automatic filters were used to prune the set, and finally Amazon Mechanical Turk was used to remove the occasional statues, paintings, or photos of photos."
This website is here to educate the public about the practice of machine learning as data hungry behemoth that can only exist, because researchers and companies scrape data from the web. In this case all photographers are willingly licensing their pictures for reuse and redistribution, but did they think their models would end up in a face dataset? What about other data that unwillingly ends up hidden in computer models? This work is heavily inspired by Adam Harvey, who is doing research on other facial recognition datasets, which were collected "in the wild".
If you want your picture removed NVIDIA suggests this solution:
To find out whether your photo is included in the Flickr-Faces-HQ dataset, please click this link to search the dataset with your Flickr username.
To get your photo removed from the Flickr-Faces-HQ dataset:
Go to Flickr and do one of the following:
Tag the photo with no_cv to indicate that you do not wish it to be used for computer vision research.
Change the license of the photo to None (All rights reserved) or any Creative Commons license with NoDerivs to indicate that you do not want it to be redistributed.
Make the photo private, i.e., only visible to you and your friends/family.
Get the photo removed from Flickr altogether.
Contact email@example.com. Please include your Flickr username in the email.
We will check the status of all photos from the particular user and update the dataset accordingly.