0 citations0 references

FiLM: Visual Reasoning with a General Conditioning Layer

Proceedings of the AAAI Conference on Artificial Intelligence2018Vol. 32(1)

Citations Over TimeTop 1% of 2018 papers

Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, Aaron Courville

Abstract

We introduce a general-purpose conditioning method for neural networks called FiLM: Feature-wise Linear Modulation. FiLM layers influence neural network computation via a simple, feature-wise affine transformation based on conditioning information. We show that FiLM layers are highly effective for visual reasoning - answering image-related questions which require a multi-step, high-level process - a task which has proven difficult for standard deep learning methods that do not explicitly model reasoning. Specifically, we show on visual reasoning tasks that FiLM layers 1) halve state-of-the-art error for the CLEVR benchmark, 2) modulate features in a coherent manner, 3) are robust to ablations and architectural modifications, and 4) generalize well to challenging, new data from few examples or even zero-shot.

Related Papers

→ A Benchmark Characterization of the EEMBC Benchmark Suite(2009)218 cited
→ Solutions to the Third Benchmark Control Problem(1991)3 cited
Theoretical Analysis of the Benchmark for Choosing Manipulative Instruments of Monetary Policies(2009)
→ Exploring disk performance benchmarks(2017)
→ Support Structure Performance Benchmark(2023)