Image Super-Resolution for Anime-Style Art
fork from : https://github.com/nagadomi/waifu2x.git

nagadomi 42d11a9be0 Merge pull request #48 from mrkn/patch-1		10 years ago
appendix	291a139ed8 improve nginx config	10 years ago
assets	ffcb460dfc fix hit	10 years ago
cache	1273b3609e first commit	10 years ago
data	243e9044cd add slide and appendix	10 years ago
images	95e9d276a3 add new model for photo	10 years ago
lib	306ee3c76d Fix typo	10 years ago
models	95e9d276a3 add new model for photo	10 years ago
.gitignore	2231423056 update training script	10 years ago
LICENSE	f2f5c882eb add LICENSE and NOTICE	10 years ago
NOTICE	f2f5c882eb add LICENSE and NOTICE	10 years ago
README.md	ef9e1c42ce Added descriptions on how to install cud	10 years ago
cleanup_model.lua	a706892b59 merge develop repo	10 years ago
convert_data.lua	a706892b59 merge develop repo	10 years ago
cudnn2cunn.lua	5b4d692f03 add support for RGB color space reconstruction	10 years ago
export_model.lua	a706892b59 merge develop repo	10 years ago
train.lua	5b4d692f03 add support for RGB color space reconstruction	10 years ago
train.sh	5b4d692f03 add support for RGB color space reconstruction	10 years ago
waifu2x.lua	5b4d692f03 add support for RGB color space reconstruction	10 years ago
web.lua	a7a5f5b1b2 fix disk full	10 years ago

waifu2x

Image Super-Resolution for anime-style-art using Deep Convolutional Neural Networks.

Demo-Application can be found at http://waifu2x.udp.jp/ .

Summary

Click to see the slide show.

References

waifu2x is inspired by SRCNN [1]. 2D character picture (HatsuneMiku) is licensed under CC BY-NC by piapro [2].

[1] Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, "Image Super-Resolution Using Deep Convolutional Networks", http://arxiv.org/abs/1501.00092
[2] "For Creators", http://piapro.net/en_for_creators.html

Public AMI

AMI ID: ami-0be01e4f
AMI NAME: waifu2x-server
Instance Type: g2.2xlarge
Region: US West (N.California)
OS: Ubuntu 14.04
User: ubuntu
Created at: 2015-08-12

Third Party Software

Third-Party

Dependencies

Hardware

NVIDIA GPU

Platform

Packages (luarocks)

cutorch
cunn
graphicsmagick
turbo
md5
uuid

Installation

Setting Up the Command Line Tool Environment

(on Ubuntu 14.04)

Install Torch7

sudo apt-get install curl
curl -s https://raw.githubusercontent.com/torch/ezinstall/master/install-all | sudo bash

see Torch (easy) install

Install CUDA

Download Cuda

sudo dpkg -i cuda-repo-ubuntu1404_7.0-28_amd64.deb
sudo apt-get update
sudo apt-get install cuda

Install packages

sudo luarocks install cutorch
sudo luarocks install cunn
sudo apt-get install graphicsmagick libgraphicsmagick-dev
sudo luarocks install graphicsmagick

Test the waifu2x command line tool.

th waifu2x.lua

Setting Up the Web Application Environment (if you needed)

Install luajit 2.0.4

curl -O http://luajit.org/download/LuaJIT-2.0.4.tar.gz
tar -xzvf LuaJIT-2.0.4.tar.gz
cd LuaJIT-2.0.4
make
sudo make install

Install packages

Install luarocks packages.

sudo luarocks install md5
sudo luarocks install uuid
sudo luarocks install turbo

Web Application

Run.

th web.lua

View at: http://localhost:8812/

Command line tools

Noise Reduction

th waifu2x.lua -m noise -noise_level 1 -i input_image.png -o output_image.png

th waifu2x.lua -m noise -noise_level 2 -i input_image.png -o output_image.png

2x Upscaling

th waifu2x.lua -m scale -i input_image.png -o output_image.png

Noise Reduction + 2x Upscaling

th waifu2x.lua -m noise_scale -noise_level 1 -i input_image.png -o output_image.png

th waifu2x.lua -m noise_scale -noise_level 2 -i input_image.png -o output_image.png

Video Encoding

* avconv is ffmpeg on Ubuntu 14.04.

Extracting images and audio from a video. (range: 00:09:00 ~ 00:12:00)

mkdir frames
avconv -i data/raw.avi -ss 00:09:00 -t 00:03:00 -r 24 -f image2 frames/%06d.png
avconv -i data/raw.avi -ss 00:09:00 -t 00:03:00 audio.mp3

Generating a image list.

find ./frames -name "*.png" |sort > data/frame.txt

waifu2x (for example, noise reduction)

mkdir new_frames
th waifu2x.lua -m noise -noise_level 1 -resume 1 -l data/frame.txt -o new_frames/%d.png

Generating a video from waifu2xed images and audio.

avconv -f image2 -r 24 -i new_frames/%d.png -i audio.mp3 -r 24 -vcodec libx264 -crf 16 video.mp4

Training Your Own Model

Data Preparation

Genrating a file list.

find /path/to/image/dir -name "*.png" > data/image_list.txt

(You should use PNG! In my case, waifu2x is trained with 3000 high-resolution-noise-free-PNG images.)

Converting training data.

th convert_data.lua

Training a Noise Reduction(level1) model

mkdir models/my_model
th train.lua -model_dir models/my_model -method noise -noise_level 1 -test images/miku_noisy.png
th cleanup_model.lua -model models/my_model/noise1_model.t7 -oformat ascii
# usage
th waifu2x.lua -model_dir models/my_model -m noise -noise_level 1 -i images/miku_noisy.png -o output.png

You can check the performance of model with models/my_model/noise1_best.png.

Training a Noise Reduction(level2) model

th train.lua -model_dir models/my_model -method noise -noise_level 2 -test images/miku_noisy.png
th cleanup_model.lua -model models/my_model/noise2_model.t7 -oformat ascii
# usage
th waifu2x.lua -model_dir models/my_model -m noise -noise_level 2 -i images/miku_noisy.png -o output.png

You can check the performance of model with models/my_model/noise2_best.png.

Training a 2x UpScaling model

th train.lua -model_dir models/my_model -method scale -scale 2 -test images/miku_small.png
th cleanup_model.lua -model models/my_model/scale2.0x_model.t7 -oformat ascii
# usage
th waifu2x.lua -model_dir models/my_model -m scale -scale 2 -i images/miku_small.png -o output.png

You can check the performance of model with models/my_model/scale2.0x_best.png.

README.md

waifu2x

Summary

References

Public AMI

Third Party Software

Dependencies

Hardware

Platform

Packages (luarocks)

Installation

Setting Up the Command Line Tool Environment

Install Torch7

Install CUDA

Install packages

Setting Up the Web Application Environment (if you needed)

Install luajit 2.0.4

Install packages

Web Application

Command line tools

Noise Reduction

2x Upscaling

Noise Reduction + 2x Upscaling

Video Encoding

Training Your Own Model

Data Preparation

Training a Noise Reduction(level1) model

Training a Noise Reduction(level2) model

Training a 2x UpScaling model