xiphmont

Before we get into the update itself, yes, the level of magenta in that banner image got away from me just a bit. Then it was just begging for inappropriate abuse of a font...

Ahem.

Hey everyone! I just posted a Daala update that mostly has to do with still image performance improvements (yes, still image in a video codec. Go read it to find out why!). The update includes metric plots showing our improvement on objective metrics over the past year and relative to other codecs. Since objective metrics are only of limited use, there's also side-by-side interactive image comparisons against jpeg, vp8, vp9, x264 and x265.

The update text (and demo code) was originally for a July update, as still image work was mostly in the beginning of the year. That update get held up and hadn't been released officially, though it had been discovered by and discussed at forums like doom9. I regenerated the metrics and image runs to use latest versions of all the codecs involved (only Daala and x265 improved) for this official better-late-than-never progress report!

Flat | Top-Level Comments Only

From:

xiphmont.livejournal.com

This is not a crazy idea, and it's not even that hard to make it work. Quite a bit or research either does just this, or is at least inspired by the idea.
The hard part is making it work better than preexisting techniques.

From: (Anonymous)

Thanks by response, Monty.

Motivation: Ghost (audio codec) split audio in tone + noise, applying different techniques to each part.
Motivation 2: DCT related transformation does not deal hard edges very well (specially after quantization).

My idea came after read this research:

http://www.cse.cuhk.edu.hk/~leojia/projects/L0smoothing/index.html

The idea whas:

1 - Vectorize the 'l0 smothed' version of image.
2 - Use DCT related (or any other frequency based transformation) to the 'difference' (texture?).

Maybe this idea can be used in future codec, not in (near finished) Daala.

Maurício Kanada

(another anonymous)

Recent deep learning approach looks promising for replacing DCT based model.
It's possible to apply optimized set of (pre-trained) filters for specific type of image/block - noisy, pattern, gradient, landscape, face, ...

Reference
http://the-locster.livejournal.com/110724.html
http://www.cs.nyu.edu/~ranzato/research/projects.html#sparse_coding

A Fabulous Daala Holiday Update

A Fabulous Daala Holiday Update

Re: New way to represent images

Re: New way to represent images

Re: New way to represent images

Profile

xiphmont

A Fabulous Daala Holiday Update

A Fabulous Daala Holiday Update

Re: New way to represent images

Re: New way to represent images

Re: New way to represent images

Profile

Most Popular Tags