News
We have launched the Co-AI research group web page!
Our papers “Differentiable Task Graph Learning” (NeurIPS 2024) and “Ego-Exo4D” (CVPR 2024) have received the EgoVis Distinguished Paper Award 2024/2025, announced at the EgoVis Workshop at CVPR 2026. Big congrats to Luigi Seminara and all co-authors!
I was recognized as an Outstanding Area Chair at CVPR 2026.
Our paper Task Graph Maximum Likelihood Estimation for Procedural Activity Understanding in Egocentric Videos has been accepted at IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).
I will be serving as an Area Chair for NeurIPS 2026.
Our paper “Integrating Affordances and Attention models for Short-Term Object Interaction Anticipation” has been accepted to the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)!
The preprint is available on arXiv.
I will be serving as an Area Chair for ICML 2026.
I’ll be serving as a Co-Publicity Chair for the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026.
Serving as an Area Chair for ECCV 2026.
I’ll be serving as a Publicity Chair for the 22nd Conference on Advanced Visual and Signal-Based Systems (AVSS) 2026, which will be held in Lecce, Italy!
The Ego-Exo4D IJCV extension is out: https://link.springer.com/article/10.1007/s11263-025-02557-6.
Our paper “How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?” by Giuseppe Lando, Rosario Forte, Giovanni Maria Farinella, and Antonino Furnari, has been awarded the best student paper award at the 23rd International Conference on Image Analysis and Processing (ICIAP 2025).
Three papers accepted for publication at the IEEE Winter Conference on Application of Computer Vision (WACV) 2026:
- Zaira Manigrasso, Matteo Dunnhofer, Antonino Furnari, Moritz Nottebaum, Antonio Finocchiaro, Davide Marana, Rosario Forte, Giovanni Maria Farinella, Christian Micheloni (2026). Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory. In IEEE Winter Conference on Application of Computer Vision (WACV).
- Michele Mazzamuto, Daniele Di Mauro, Gianpiero Francesca, Giovanni Maria Farinella, Antonino Furnari (2026). ProSkill: Segment-Level Skill Assessment in Procedural Videos. In IEEE Winter Conference on Application of Computer Vision (WACV).
- Francesco Ragusa, Michele Mazzamuto, Rosario Forte, Irene D’Ambra, James Fort, Jakob Engel, Antonino Furnari, Giovanni Maria Farinella (2026). Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance. In IEEE Winter Conference on Application of Computer Vision (WACV).
I will be serving as an Area Chair for CVPR 2026.
I’ll be giving two talks at the following ICCV 2025 Workshops:
- October 19: ICCV 2025 Workshop on AI-driven Skilled Activity Understanding, Assessment & Feedback Generation.
- October 20: Workshop on Scene Graphs and Graph Representation Learning.
7 papers accepted at the 23rd International Conference on Image Analysis and Processing (ICIAP 2025)!
Of these, 3 have been accepted for oral presentation, 2 as posters, and 2 in the workshops.
Oral Presentations:
- Catinello, A. S., Farinella, G. M., & Furnari, A. (2025). Mamba-OTR: a Mamba-based Solution for Online Take and Release Detection from Untrimmed Egocentric Video.
- Finocchiaro, A., Farinella, G. M., & Furnari, A. (2025). Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation.
- Lando, G., Forte, R., Farinella, G. M., & Furnari, A. (2025). How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?
Poster Presentations:
- Catinello, A. S., Dunnhofer, M., Farinella, G. M., Frontoni, E., Furnari, A., Micheloni, C., Paolanti, M., Pietrini, R., Salierno, D., Stacchio, L., & Yaar, A. (2025). Ego and exo views for an object-level human behavior analysis and understanding through tracking in retail spaces.
- Manigrasso, Z., Finocchiaro, A., Manara, D., Forte, R., Nottebaum, M., Dunnhofer, M., Farinella, G. M., Furnari, A., & Micheloni, C. (2025). T-EVO: Tracking in Egovision for Online Visual Episodic Memory.
Workshop Papers:
- Yaar, A., Rodin, I., Farinella, G. M., & Furnari, A. (2025). A Benchmark of Egocentric Scene Graph Prediction Methods for Understanding Human-Object Interactions.
- Finocchiaro, A., Catinello, A. S., Mazzamuto, M., Leonardi, R., Furnari, A., & Farinella, G. M. (2025). A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains.
I gave a week-long seminar on Video Understanding at Universidad Carlos III de Madrid to the students of the PhD in Signal Processing and Communications Engineering.

Serving as an Area Chair for WACV 2026.
I’ll give a talk at the AICV4Food workshop @ ICIAP 2025
I joined the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) as an Associate Editor.
Serving as publicity co-chair of ICCV 2025!
I’ll give a talk at the Eyes Of The Future: Integrating Artificial Intelligence in Smart Eyewear (IAISE) workshop at IJCNN 2025 on July 5th in Rome, Italy.
I’ll give a talk at the Egocentric Perception & Action for Robot Learning workshop at RSS 2025 on June 21st in Los Angeles, US.
I’ll give a talk at the Enhancing human mobility: From computer vision-based motion tracking to wearable assistive robot control workshop at ICRA 2025 on May 23rd in Atlanta, US.
Got the italian Habilitation ASN - Abilitazione Scientifica Nazionale) as a Full Professor (Fascia I) in Computer Science (01/B1) and Information Processing Systems (09/H1)
I’ll serve as Program Chair of the VISAPP 2026 Conference
I joined the Pattern Recognition journal as an Associate Editor.
Gave a talk titled “Beyond atomic actions: towards long-form and procedural understanding of egocentric videos” at the Video Understanding Applications workshop @ BMVC 2024. Slides here
Serving as an Area Chair for IJCAI 2025
Serving as an associated editor for ICRA 2025
Our paper on differentiable task graphs has been accepted at NeurIPS 2024 as a spotlight!
Seminara, Farinella, Furnari (2024). Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric Videos. In Advances in Neural Information Processing Systems. [paper] [code]
A paper describing the Ego4D dataset more in details has been published on TPAMI!
Three papers accepted at ECCV 2024!
- Lorenzo Mur-Labadia, Ruben Martinez-Cantin, Josechu Guerrero, Giovanni Maria Farinella, Antonino Furnari. AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation. [Paper]
- Camillo Quattrocchi, Antonino Furnari, Daniele Di Mauro, Mario Valerio Giuffrida, Giovanni Maria Farinella. Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs. [Paper]
- Rosario Leonardi, Antonino Furnari, Francesco Ragusa, Giovanni Maria Farinella. Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection? [Paper]
We are among winners of two challenges at the EgoVis workshop:
- 🥇 1st place at the EgoVis HoloLens Mistake Detection Challenge with a solution based on gaze analysis detailed here.
- 🥈 2nd place at the EgoVis Ego4D Short Term Anticipation Challenge with a solution based on the paper "AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation" in collaboration with Univ. Zaragoza.
Serving as an Area Chair for WACV 2025
Serving as an Area Chair for NeurIPS 2024 - Benchmarks and Datasets Track
I’ll give a talk at the Precognition workshop at CVPR 2025 on June 12th 2025 in Nashville, US.
Giving a seminar at the University of Zaragoza Research Seminars course
Three papers accepted at CVPR 2024! (1 oral + 2 posters):
- Alessandro Flaborea, Guido Maria D'Amely di Melendugno, Leonardo Plini, Luca Scofano, Edoardo De Matteis, Antonino Furnari, Giovanni Maria Farinella, Fabio Galasso. PREGO: online mistake detection in PRocedural EGOcentric videos[Paper]
- Ivan Rodin, Antonino Furnari, Kyle Min, Subarna Tripathi, Giovanni Maria Farinella. Action Scene Graphs for Long-Form Understanding of Egocentric Videos. [Paper]
- Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives. With other 100 authors! Oral < 1% accept rate. [Paper]
I’ll serve as Program Chair of the VISAPP 2025 Conference
The TEAM project has officially started!
The EXTRA-EYE project has officially started!
Giving a tutorial on Egocentric Vision at ICIAP 2023
Serving as an Area Chair for WACV 2024
Aug 2023: Survey paper An Outlook into the Future of Egocentric Vision open for comments on OpenReview until 15 Sep.
PNRR PRIN Project “TEAM” has been accepted and will be funded by the Italian ministry of University and Research
Serving as Academic Assessment at ICVSS 2023
PRIN Project “EXTRA-EYE” has been accepted and will be funded by the Italian ministry of University and Research
Paper “Streaming egocentric action anticipation: an evaluation scheme and approach” accepted for publication in the Computer Vision and Image Understanding Journal
Paper “MECCANO: A multimodal egocentric dataset for humans behavior understanding in the industrial-like domain accepted for publication in the Computer Vision and Image Understanding Journal
I’ll serve as Program Chair of the VISAPP 2024 Conference
I’ll give a tutorial at the Italian Summer School VISMAC 2023
I am an ELLIS member
I’ll serve as an Area Chair for ICCV 2023
The EGO4D paper is accepted for presentation at CVPR 2023
The EGO4D dataset is publicly available!