Publications

Filter by type:

Reversible Vision Transformers

Details PDF Video Project

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

Details PDF Code

Improved Multiscale Vision Transformers for Classification and Detection

Details PDF Code

From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting

Details PDF Code

Multiscale Vision Transformers

Details PDF Code Project

LOKI: Long Term and Key Intentions for Trajectory Prediction

Details PDF

Overcoming Mode Collapse with Adaptive Multi Adversarial Training

Details PDF Code

Object-Region Video Transformers

Details PDF Code Project

It Is Not the Journey but the Destination: Endpoint Conditioned Trajectory Prediction

Details PDF Code Project

Long-term Human Motion Prediction with Scene Context

Details PDF Video Code Project

Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision

Details PDF Slides Video

Learning Spontaneity to Improve Emotion Recognition in Speech

Details PDF Slides Video Code

Future Person Localization in First-Person Videos

Details PDF Code

Do deep neural networks learn shallow learnable examples first?

Details PDF Slides Video Code

Bitwise Operations of Cellular Automaton on Gray-scale Images

Details PDF Slides Code