dvpa: digital video processing & applications course...

DVPA: Digital Video Processing &

Applications

Course Introduction

郑喆坤西安电子科技大学电子工程学院

[email protected]

2015.09.17

Digital Video Processing & Analysis

Instructor: Cheolkon Jung

Office: #217, #100 Building

Email: [email protected]

Webpage: http://web.xidian.edu.cn/zhengzk

http://see.xidian.edu.cn/media

Lecture time:

Monday AM 8:30-10:00 (Every week);

Lecture place: J3-08, Teaching Building

mailto:[email protected]

http://web.xidian.edu.cn/zhengzk

http://web.xidian.edu.cn/zhengzk

http://see.xidian.edu.cn/media

Webpage http://web.xidian.edu.cn/zhengzk

Digital Video Processing & Analysis

Office:

北校区主楼III区309

Email:

[email protected]

Media Lab

Advanced media

Ultra high definition (UHD) video

High dynamic range (HDR) video

Surveillance video

Social media & Big data

Stereoscopic 3D video

Main research topics

Multimedia annotation, retrieval, and security

Image restoration & Super-resolution

Computational photography

Perceptual video coding and H.265

Stereoscopic 3D video processing

Topics include, but are not limited to them

Research In Progress


1. Stereoscopic 3D Media

13

2D to 3D conversion 3DTV

Development of core technologies for stereoscopic 3D media:

1) Conversion of existing 2D into exciting 3D

2) Free viewpoint synthesis

3) High-quality stereoscopic view generation

4) Perceptual stereoscopic video coding

5) Real-time rendering using GPU

6) Visual fatigue prediction and reduction


14

Various displays from small size to large size

: Image/video super-resolution based on machine learning

Different kinds of displays with various size such as TV, PC monitor, mobile phone

Resolution of images and videos is fixed.

No high frequency information while playing small size images in large size display

2. Super-resolution

3. Image Restoration

15

: Image restoration which preserves image details and reduces noise/artifacts

Most imaging devices use CCD/CMOS sensors.

Captured images are compressed for efficient transmission and storing.

Sensor noise and compression artifacts inevitably occur and degrade image quality.

Sensor noiseImaging Devices Compression

Artifact


4. Semantic Concept Detection

16

Manual annotation for huge archive

is labor-intensive and time-consuming

Labeled samples are too small


Exponentially Increasing Personal Contents

How to retrieve our favorite contents?

Semi-supervised

learning based

semantic concept

detection

5. Computational Photography

17


Tone mapping

Mobile imaging devices with comparable computing power (iPhone 5: 1.5GHz CPU)

1) High dynamic range imaging by tone mapping and multi-exposure fusion

2) Specularity removal in color images

3) Interactive image segmentation and background change

4) High-quality motion deblurring

5) Power constrained image contrast enhancement

Background change

Outline

3

24

2 Why digital?

What is DVPA?

Why DVPA Is Required?

25 Applications of DVPA

26 Course overview

21 About Videos

18

Videos

3D Videos

Rich information from visual data.

Moving pictures: videos

Movies in Theaters

Broadcast contents: newscast, sportscast, entertainment and

education contents, etc.

Personal videos obtained by video camera, webcam, mobile cam,

etc.

CCTV surveillance videos

About Videos

Difference Between Images and Videos

Images

Shape

Color

Texture

Videos

Shape

Color

Texture

Motion

Depth in 3D videos

Motion

Depth

Human 3D: perceive depth information from two slightly

different views by two eyes.

Depth Perception

About Videos

Source of videos:

Captured with an analog or digital TV/movie camera

Time-based sequences: radar, satellite, or aerial surveillance

vehicle

Video units:

Why Digital?

Exactness

Perfect reproduction without degradation

Perfect duplication of processing result

Convenient & powerful computer-aided processing

Can perform rather sophisticated processing through hardware

or Software

Even kids can do it!

Easy storage and transmission

1 CD can store 1 video or hundreds of family photos!

Paperless transmission of high quality photos through network

within seconds

Films Sensors

What is DVPA?

DVPA deals with computational frameworks

To extract useful video information

To restore original video information

To compress huge amounts of video data

To represent (structure) raw unstructured videos

To assess video quality (video quality assessment)

DVPA:

What is DVPA?

Digital input: imaging devices (video cam, digital cam, web cam)

Low level processing: encoding (MPEG, JPEG, etc), video

processing & analysis (filtering, enhancement, etc)

Transmission/storage: channel coding (transmission error

minimization)

Remote video processing: decoding, video processing &

analysis

Digital output: displays (TV, mobile devices, etc)

Why DVPA is required?

Facilitate image storage and transmission: Efficiently store an image in a digital camera

Send images to other persons effectively

Image/video compression, motion estimation

Prepare for display or printing: Adjust image size

Video enhancement, video super-resolution, image half-toning

Enhance and restore images: Remove noise and artifacts from transmission or compression

Video denoising, compression artifact reduction

Extract useful information from images: Extract or track objects including faces, texts, and so on

Object segmentation & detection & recognition & tracking, behavior detection & understanding, video segmentation & representation & indexing & annotation & retrieval


Increasing Personal Contents Storage Size Explosion: 1 Terabyte (2007)

Multiple Channels: +300ch (2007)

Personal Contents Explosion

Difficult to Retrieve Want to Preview the ContentsNot Enough Time

to Consume


Popular mobile imaging devices: Smart phone, video camera, personal media player (PMP), etc

Storage size expansion: HDD, CD, DVD, flash memory, etc

Digital videos are increasing exponentially: Social, commercial, industry and military applications

Entertainment, education, medicine, databases, security, etc.

High-speed computing environments in most digital devices

Accordingly, DVPA has been practically applied to several video applications.


Video compression

Video enhancement

Video surveillance

Video content analysis

Video indexing & retrieval

Object detection & recognition

Motion estimation

Research Fields

Stereo video processing

Applications of DVPA

Video compression: Lossless & lossy compression

MPEG, H.264, AVC, M-JPEG, Microsoft VC-9, etc

Video enhancement: Video super-resolution

Video denoising & compression artifact reduction

Video surveillance: Video object detection, recognition, and tracking

Motion estimation and segmentation

Action recognition

Video content analysis: Video segmentation & indexing & browsing

Video summarization & retrieval

Stereo video processing: Depth estimation & 2D-to-3D conversion

Depth-image-based-rendering

Free viewpoint view synthesis

Image Superresolution

Optical Engineering 2013

Image Superresolution

(a) Original image, (b) Nearest neighbor, (c) Bicubic, (d) TOG 2008,

(e) TIP 2010, (f) TPAMI 2010, (g) Pro-I (coupled dictionary learning), (h) Pro-II (KSVD)

GPU-Accelerated Image Super-resolution

JRTIP 2015

De-convolution based image super-resolution on GPU and Multicore Platform

Gradient-consistency-anisotropic-regularization prior (Frequency domain)

Parallel Computing

IEEE ICME 2014; SPIC 2014

42

Face Hallucination

More natural-looking-facial images are required for

high-quality image communications

Resolution of images is fixed.

No high frequency information.

Learning based super-

resolution method

Low-Resolution

High-Resolution

Face Hallucination

High-Quality Image Communications

IEEE SPL 2011

43

Face Hallucination

More natural looking facial images!!

LR Proposed HR

Image Deblocking

<Learned Dictionary>

Block Diagram

Signal Processing: Image Communication 2012

ScienceDirect Top25 Hottest Article

Image Deblocking

Video Deblocking

Interactive Image Segmentation

Interactive Image Segmentation

Original Image Interactive Markers Segmentation Result

Pattern Recognition 2014

Step 1.

Interactive ROI

extraction (part 1).

Step 2.

Initial image ranking

(Part 2).

Step 3.

Relevance feedback

and global

discriminative

learning by ACSP

(Part 4).

Step 4.

Image reranking for

retrieval (Part 2).

Neurocomputing 2015

Interactive Image Retrieval

Interactive Image Retrieval

Neurocomputing 2015

52

Specularity Removal

Original Image Diffuse Reflection Specular Reflection


53

Multi-exposure Fusion

Different exposure images Fusion result

Over

Normal

Under


Sports Video Indexing

<Overview of the proposed method>

IEEE TBC 2009

Sports Video Indexing

<Defined exciting events>

Recognized scores

Depth Estimation

Multimedia Systems 2015

57

2D-to-3D Conversion

Proposed Framework

IEEE TMM 2014

2D-to-3D Conversion

IEEE TMM 2014

2D-to-3D Conversion

60

Depth-Image-Based Rendering

Holes Proposed MethodIEEE ICME 2014

Depth-Image-Based Rendering

Stereo-Plus-Depth Imaging System

Stereo+Depth Imaging System

Xidian & SKKU

Joint Research

Superpixel Matching Based Depth Propagation

IEEE ICIP 2015

Block Matching Proposed Ground truth

Outline

3

24

2 Why digital?

What is DVPA?

Why DVPA Is Required?

25 Applications of DVPA

26 Course overview

21 About Videos

Course Overview

Main topics: Some Basics on Digital Videos

Video Modeling

Motion Estimation

Video Compression

Video Content Analysis

Stereo and 3D Videos

Text book:

Main: 视频信号处理与通信 (Video processing and communications by Wang et al., 清华大学出版社)

Side: Journal or Conference Papers

Course Overview

Methods Lecturer: provide high-quality lecture

• Provide lectures on some basic theories such as main topics of DVPA

• Guide attendees to achieve successful study results

• Evaluate proposal/presentation/demonstration/technical report

Attendees: deal with one of the practical DVPA issues• Select one issue among the recommended candidates by the

lecturer

• Make a proposal and study it after the lecturer confirmation

• Present and demonstrate the study results

• Submit the technical report & source code (C or C++)

All of the lectures will be provided in English

Proposal/presentation/technical report should be provided in English

Course Overview

Schedule (total 32 hours):

Course introduction (2 hours)

Basics of analog and digital video (6 hours)

Motion estimation and analysis (4 hours)

Video filtering and enhancement (6 hours)

Video compression & standards (6 hours)

Applications (8 hours)

Evaluation:

Attendance (10)

Student project (60+Bonus)

Final examination (20)

Homework (10)

Course Overview

Criteria:

Difficulty

Execution (completion)

Presentation (attitude, technical writing)

If there is novelty in the technical report, bonus points will be

given.

dvpa: digital video processing & applications course...

Documents