Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High resolution image #3

Open
rikusen0335 opened this issue Aug 13, 2017 · 10 comments
Open

High resolution image #3

rikusen0335 opened this issue Aug 13, 2017 · 10 comments

Comments

@rikusen0335
Copy link

Please be able to export images with high resolution.

@Misaka-0x447f
Copy link

Misaka-0x447f commented Aug 14, 2017

with waifu2x? (x
generate higher resolution img may require more (2^n times) powerful computer for training model, i think.

@Aixile
Copy link
Member

Aixile commented Aug 14, 2017

We described our attempts of generating high-resolution images in the paper.

In our situation, the limitation of the resolution mainly comes from the training dataset.
Personally, I don't think waifu2x gives a satisfying result on our generated images since the low-resolution images are not downscaled ones, it means that the low-resolution images lack necessary details for reconstructing high-frequency patterns in high-resolution images.

We also try to train our own SRGAN based super-resolution models, but not able to find a satisfying model yet. Therefore we leave the exploration of super-resolution models for future work.

@Skylion007
Copy link

@Aixile I might be able to help if you need a high res training set.

@YukiSakuma
Copy link

YukiSakuma commented Aug 15, 2017

edit: anyway I agree too with high res option but it would need to be trained using a high res dataset and after that I suggest adding an option on what type of expression, or -dere expression(e.g yandere, tsundere, kuudere), etc.

@ghost
Copy link

ghost commented Aug 18, 2017

I am tried from this command to proceed new high-res images.

cd /tmp
cp index.png o.png
waifu2x-converter-cpp -m scale --scale_ratio 1.15 -i o.png -o o2.png; waifu2x-converter-cpp --scale_ratio 1.3 --noise_level 3 -i o2.png -o o.png;
waifu2x-converter-cpp -m scale --scale_ratio 1.15 -i o.png -o o2.png; waifu2x-converter-cpp --scale_ratio 1.3 --noise_level 3 -i o2.png -o o.png;
waifu2x-converter-cpp -m scale --scale_ratio 1.15 -i o.png -o o2.png; waifu2x-converter-cpp --scale_ratio 1.3 --noise_level 3 -i o2.png -o o.png;

index.png is your target image which is anime face.
The result is bad, the image is lack of some detail. It must solved from source.
It probably means the "data training" must use high-res images.

@Aixile Yes, you are right.

PS: https://github.com/DeadSix27/waifu2x-converter-cpp

@shaform
Copy link

shaform commented Sep 30, 2017

@Aixile The paper says:

Finally, from 42000 face images in total from the face detector, we manually
check all anime face images and remove about 4% false positive and undesired images.
We train our GAN model using only images from games released after 2005 and with scaling all
training images to a resolution of 128*128 pixels. This gives 31255 training images in total.

Did you also scale up small images to 128x128 or just use images larger than 128x128?

From figure 14, it appears that more than half of the 42000 faces are smaller than 128x128, so if there were 31255 training images, it suggests that the small images were also used. Did this have any negative effects?

@Aixile
Copy link
Member

Aixile commented Sep 30, 2017

@shaform I used images larger than 80*80 in the original 42000 faces. Personally, I think 1.5x upsampling is still affordable.

@shaform
Copy link

shaform commented Sep 30, 2017

@Aixile Thanks! BTW, do you think it's possible to just extract faces from episodes of animes? Maybe we could get high volume of high resolution, high quality faces in this way.

@Aixile
Copy link
Member

Aixile commented Sep 30, 2017

@shaform Yes, but one fundamental problem for building large and high-quality anime face dataset is that lbpcascade_animeface, the only anime face detection library available online, uses old-fashioned methods and has a poor precision/recall...

We are still improving our method and the dataset. Currently, we have achieved a much better result on CelebA dataset compared to the model we published. Hopefully to release the new model several months later.

@rtanen
Copy link

rtanen commented Dec 14, 2017

The website can produce 256x256 images now, which might close this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants