Sia HackewrNoon

Table of Links

Supplementary Material

Architecture details
Image matting

8.1. Dataset generation and preparation

8.2. Training details

8.3. Quantitative details

8.4. More qualitative results on natural images
Video matting

9.1. Dataset generation

9.2. Training details

9.3. Quantitative details

9.4. More qualitative results

5. Experiments

We developed our model using PyTorch [20] and the Sparse convolution library Spconv [10]. Our codebase is built upon the publicly available implementations of MGM [56] and

Table 2. Details of Video Instance Matting Training and Testing Sets. V-HIM2K5 for training and V-HIM60 for model evaluation. Each video contains 30 frames.

Table 3. Superiority of Mask Embedding Over Stacking in HIM2K+M-HIM2K. Our mask embedding technique demonstrates enhanced performance compared to traditional stacking methods.

OTVM [45]. In the first Sec. 5.1, we discuss the results when pre-training on the image matting dataset. The performance on the video dataset is shown in the Sec. 5.2. All training settings are reported in the supplementary material.

Authors:

(1) Chuong Huynh, University of Maryland, College Park (chuonghm@cs.umd.edu);

(2) Seoung Wug Oh, Adobe Research (seoh,jolee@adobe.com);

(3) Abhinav Shrivastava, University of Maryland, College Park (abhinav@cs.umd.edu);

(4) Joon-Young Lee, Adobe Research (jolee@adobe.com).

This paper is available on arxiv under CC by 4.0 Deed (Attribution 4.0 International) license.

Image and Video Matting Benchmarks: Performance Analysis of MaGGIe

Table of Links

5. Experiments