Filippo Casu at GTI

 

Research  

 

GTI Data   

 

Open databases created and software developed by the GTI and supplemental material to papers.  

 

Databases  


SportCLIP (2025): Multi-sport dataset for text-guided video summarization.
Ficosa (2024):
The FNTVD dataset has been generated using the Ficosa's recording car.
MATDAT (2023):  More than 90K labeled images of martial arts tricking.
SEAW – DATASET (2022): 3 stereoscopic contents in 4K resolution at 30 fps.
UPM-GTI-Face dataset (2022): 11 different subjects captured in 4K, under 2 scenarios, and 2 face mask conditions.
LaSoDa (2022): 60 annotated images from soccer matches in five stadiums with different characteristics and light conditions.
PIROPO Database (2021):People in Indoor ROoms with Perspective and Omnidirectional cameras.
EVENT-CLASS (2021): High-quality 360-degree videos in the context of tele-education.
Parking Lot Occupancy Database (2020)
Nighttime Vehicle Detection database (NVD) (2019)
Hand gesture dataset (2019): Multi-modal Leap Motion dataset for Hand Gesture Recognition.
ViCoCoS-3D (2016): VideoConference Common Scenes in 3D.
LASIESTA database (2016): More than 20 sequences to test moving object detection and tracking algorithms.
Hand gesture database (2015): Hand-gesture database composed by high-resolution color images acquired with the Senz3D sensor.
HRRFaceD database (2014):Face database composed by high resolution images acquired with Microsoft Kinect 2 (second generation).
Lab database (2012): Set of 6 sequences to test moving object detection strategies.
Vehicle image database (2012)More than 7000 images of vehicles and roads.           

 

Software  


Empowering Computer Vision in Higher Education(2024)A Novel Tool for Enhancing Video Coding Comprehension.
Engaging students in audiovisual coding through interactive MATLAB GUIs (2024)

TOP-Former: A Multi-Agent Transformer Approach for the Team Orienteering Problem (2023)

Solving Routing Problems for Multiple Cooperative Unmanned Aerial Vehicles using Transformer Networks (2023)
Vision Transformers and Traditional Convolutional Neural Networks for Face Recognition Tasks (2023)
Faster GSAC-DNN (2023): A Deep Learning Approach to Nighttime Vehicle Detection Using a Fast Grid of Spatial Aware Classifiers.
SETForSeQ (2020): Subjective Evaluation Tool for Foreground Segmentation Quality. 
SMV Player for Oculus Rift (2016)

Bag-D3P (2016): 
Face recognition using depth information. 
TSLAB (2015): 
Tool for Semiautomatic LABeling.   
 

   

Supplementary material  


Soccer line mark segmentation and classification with stochastic watershed transform (2022)
A fully automatic method for segmentation of soccer playing fields (2022)
Grass band detection in soccer images for improved image registration (2022)
Evaluating the Influence of the HMD, Usability, and Fatigue in 360VR Video Quality Assessments (2020)
Automatic soccer field of play registration (2020)   
Augmented reality tool for the situational awareness improvement of UAV operators (2017)
Detection of static moving objects using multiple nonparametric background-foreground models on a Finite State Machine (2015)
Real-time nonparametric background subtraction with tracking-based foreground update (2015)  
Camera localization using trajectories and maps (2014)

 

                                                                                                                                                                                                                             
 
                                                                   
 
                                                                                                                                                             
 
      

 

 

Detection of Alzheimer's Dementia using Large Language Models applied to spontaneous speech

On Monday, October 28, Aula B-221 of the ETSIT UPM (Universidad Politécnica de Madrid) hosted a fascinating talk by Filippo Casu, a former member and friend of the Grupo de Tratamiento de Imágenes (GTI), who is currently doing research at the Università degli Studi di Sassari, Italy.

The presentation, entitled “Detection of Alzheimer's Dementia using Large Language Models applied to spontaneous speech,” attracted an audience interested in his innovative approaches to dementia diagnosis.

 

CharlaFili25 3

 

Filippo is part of the Ecosystem of Innovation for Next Generation Sardinia project, an initiative aimed at strengthening the connection between science and industry, mitigating the social impacts of the crisis and fostering territorial inclusion. This project seeks to boost innovation, facilitate the transfer of technologies to the productive system, and involve local communities in the challenges of sustainable innovation.

In a region like Sardinia, which has one of the highest elderly population rates in Europe, proactive strategies are being implemented to address the challenges associated with this demographic phenomenon.

Filippo explained that his research focuses on developing improved diagnostic tools for Alzheimer's disease (AD), which affects approximately 50 million people worldwide. Through the use of pre-trained large language models (LLMs), his team achieves automatic detection of AD through linguistic analysis applied to datasets such as ADReSS and ADReSSo (Alzheimer's Dementia Recognition through Spontaneous Speech). This process is performed after converting speech to text, which allows the identification of relevant patterns in patients' spontaneous speech.

 

CharlaFili25 4

 

While recent advances in LLMs offer a promising basis for their application in healthcare, Filippo stressed the importance of optimizing these models for specific tasks such as AD detection, seeking a balance between performance and computational efficiency. To address data privacy concerns in the healthcare sector, his methodology is implemented on affordable hardware, which facilitates local data processing without compromising security.

Filippo's talk provided attendees with valuable insight into how artificial intelligence and language modeling can revolutionize dementia diagnosis and improve early detection of the disease, thus contributing to a healthier future for older generations.

 

  CharlaFili25 5

 

For more information on Filippo Casu's research: https://doi.org/10.1109/PDP62718.2024.00046

This work has been developed within the framework of the project e.INS- Ecosystem of Innovation for Next Generation Sardinia (cod. ECS 00000038) funded by the Italian Ministry for Research and Education (MUR) under the National Recovery and Resilience Plan (NRRP) - MISSION 4 COMPONENT 2, ”From research to business” INVESTMENT 1.5, ”Creation and strengthening of Ecosystems of innovation” and construction of ”Territorial R&D Leaders”.