Virtual Worlds

 

Research  

 

GTI Data   

 

Open databases created and software developed by the GTI and supplemental material to papers.  

 

Databases  


SportCLIP (2025): Multi-sport dataset for text-guided video summarization.
Ficosa (2024):
The FNTVD dataset has been generated using the Ficosa's recording car.
MATDAT (2023):  More than 90K labeled images of martial arts tricking.
SEAW – DATASET (2022): 3 stereoscopic contents in 4K resolution at 30 fps.
UPM-GTI-Face dataset (2022): 11 different subjects captured in 4K, under 2 scenarios, and 2 face mask conditions.
LaSoDa (2022): 60 annotated images from soccer matches in five stadiums with different characteristics and light conditions.
PIROPO Database (2021):People in Indoor ROoms with Perspective and Omnidirectional cameras.
EVENT-CLASS (2021): High-quality 360-degree videos in the context of tele-education.
Parking Lot Occupancy Database (2020)
Nighttime Vehicle Detection database (NVD) (2019)
Hand gesture dataset (2019): Multi-modal Leap Motion dataset for Hand Gesture Recognition.
ViCoCoS-3D (2016): VideoConference Common Scenes in 3D.
LASIESTA database (2016): More than 20 sequences to test moving object detection and tracking algorithms.
Hand gesture database (2015): Hand-gesture database composed by high-resolution color images acquired with the Senz3D sensor.
HRRFaceD database (2014):Face database composed by high resolution images acquired with Microsoft Kinect 2 (second generation).
Lab database (2012): Set of 6 sequences to test moving object detection strategies.
Vehicle image database (2012)More than 7000 images of vehicles and roads.           

 

Software  


NaviFormer (2025): A Deep Reinforcement Learning Transformer-like Model to Holistically Solve the Navigation Problem.
Empowering Computer Vision in Higher Education(2024)
A Novel Tool for Enhancing Video Coding Comprehension.
Engaging students in audiovisual coding through interactive MATLAB GUIs (2024)

TOP-Former: A Multi-Agent Transformer Approach for the Team Orienteering Problem (2023)

Solving Routing Problems for Multiple Cooperative Unmanned Aerial Vehicles using Transformer Networks (2023)
Vision Transformers and Traditional Convolutional Neural Networks for Face Recognition Tasks (2023)
Faster GSAC-DNN (2023): A Deep Learning Approach to Nighttime Vehicle Detection Using a Fast Grid of Spatial Aware Classifiers.
SETForSeQ (2020): Subjective Evaluation Tool for Foreground Segmentation Quality. 
SMV Player for Oculus Rift (2016)

Bag-D3P (2016): 
Face recognition using depth information. 
TSLAB (2015): 
Tool for Semiautomatic LABeling.   
 

   

Supplementary material  


Viewpoint-Invariant Soccer Pitch Registration Using Geometric and Learned Features (2025)
Soccer line mark segmentation and classification with stochastic watershed transform (2022)
A fully automatic method for segmentation of soccer playing fields (2022)
Grass band detection in soccer images for improved image registration (2022)
Evaluating the Influence of the HMD, Usability, and Fatigue in 360VR Video Quality Assessments (2020)
Automatic soccer field of play registration (2020)   
Augmented reality tool for the situational awareness improvement of UAV operators (2017)
Detection of static moving objects using multiple nonparametric background-foreground models on a Finite State Machine (2015)
Real-time nonparametric background subtraction with tracking-based foreground update (2015)  
Camera localization using trajectories and maps (2014)

 

                                                                                                                                                                                                                             
 
                                                                   
 
                                                                                                                                                             
 
      

 

 

Virtual Worlds 

The future "Virtual Worlds" partnership is being set up within the Cluster 4 Digital, Industry and Space of the Horizon Europe R&D&I Program. This partnership, "Virtual Worlds", will foreseeably be one of the fundamental pillars of the strategy launched in 2023 by the European Commission to lead technological development in the so-called virtual worlds and web 4.0, as it is included in the Strategic Plan 2025-27 of Horizon Europe.

The first meeting of the institutions interested in the "Virtual Worlds" partnership was held on June 11 at the Escuela Técnica Superior de Ingenieros de Telecomunicación (ETSIT) of the Universidad Politécnica de Madrid (UPM). The meeting, organized by the Centro para el Desarrollo Tecnológico Industial (CDTI), included presentations of the AMETIC report "Metaverse: Technology, Impact and Application", the Virtual Worlds partnership by the European Commission's DG CNECT and the overview of current extended reality technologies. The meeting continued with a round table on the Spanish involvement in the partnership and ended with a visit to relevant laboratories working with extended reality technologies.

The Grupo de Tratamiento de Imágenes (GTI) actively participated in the meeting supporting the organization, contributing to the round table, and showing the activities of two laboratories. Thus, Narciso Garcia shared the round table and presented his vision on the capabilities and needs of the Spanish community potentially involved in this partnership. He remarked the need to have scientific and technological forums to exchange experiences. Thus, GTI has been the promoter of the International Summer School on eXtended Reality Technology and eXperience (XRTX), which was offered as a forum for the partnership.

 

CDT2

 

The visit to the laboratories included two of the GTI laboratories. On the one hand, Julián Cabrera coordinated the visit to the FVV Live (Free Viewpoint Video - Live), endowed with flexible camera configurations and lightweight schemes for video acquisition, transmission and visualization with minimal motion-to-photon latency, where the latest developments in the integration of local and distant video by combining volumetric (local people) and standard (distant video) information were presented. On the other hand, Jesús Gutiérrez coordinated the visit to the ImCoLab (Immersive Communications Laboratory), endowed with extended reality, volumetric video, and 360VR equipment, where extended reality developments for industrial training, extended reality for therapy (specifically batmophobia), and social extended reality applications (Social XR) were presented.

 

CDTI3