6G-XR

 

Research  

 

GTI Data   

 

Open databases created and software developed by the GTI and supplemental material to papers.  

 

Databases  


SportCLIP (2025): Multi-sport dataset for text-guided video summarization.
Ficosa (2024):
The FNTVD dataset has been generated using the Ficosa's recording car.
MATDAT (2023):  More than 90K labeled images of martial arts tricking.
SEAW – DATASET (2022): 3 stereoscopic contents in 4K resolution at 30 fps.
UPM-GTI-Face dataset (2022): 11 different subjects captured in 4K, under 2 scenarios, and 2 face mask conditions.
LaSoDa (2022): 60 annotated images from soccer matches in five stadiums with different characteristics and light conditions.
PIROPO Database (2021):People in Indoor ROoms with Perspective and Omnidirectional cameras.
EVENT-CLASS (2021): High-quality 360-degree videos in the context of tele-education.
Parking Lot Occupancy Database (2020)
Nighttime Vehicle Detection database (NVD) (2019)
Hand gesture dataset (2019): Multi-modal Leap Motion dataset for Hand Gesture Recognition.
ViCoCoS-3D (2016): VideoConference Common Scenes in 3D.
LASIESTA database (2016): More than 20 sequences to test moving object detection and tracking algorithms.
Hand gesture database (2015): Hand-gesture database composed by high-resolution color images acquired with the Senz3D sensor.
HRRFaceD database (2014):Face database composed by high resolution images acquired with Microsoft Kinect 2 (second generation).
Lab database (2012): Set of 6 sequences to test moving object detection strategies.
Vehicle image database (2012)More than 7000 images of vehicles and roads.           

 

Software  


NaviFormer (2025): A Deep Reinforcement Learning Transformer-like Model to Holistically Solve the Navigation Problem.
Empowering Computer Vision in Higher Education(2024)
A Novel Tool for Enhancing Video Coding Comprehension.
Engaging students in audiovisual coding through interactive MATLAB GUIs (2024)

TOP-Former: A Multi-Agent Transformer Approach for the Team Orienteering Problem (2023)

Solving Routing Problems for Multiple Cooperative Unmanned Aerial Vehicles using Transformer Networks (2023)
Vision Transformers and Traditional Convolutional Neural Networks for Face Recognition Tasks (2023)
Faster GSAC-DNN (2023): A Deep Learning Approach to Nighttime Vehicle Detection Using a Fast Grid of Spatial Aware Classifiers.
SETForSeQ (2020): Subjective Evaluation Tool for Foreground Segmentation Quality. 
SMV Player for Oculus Rift (2016)

Bag-D3P (2016): 
Face recognition using depth information. 
TSLAB (2015): 
Tool for Semiautomatic LABeling.   
 

   

Supplementary material  


Viewpoint-Invariant Soccer Pitch Registration Using Geometric and Learned Features (2025)
Soccer line mark segmentation and classification with stochastic watershed transform (2022)
A fully automatic method for segmentation of soccer playing fields (2022)
Grass band detection in soccer images for improved image registration (2022)
Evaluating the Influence of the HMD, Usability, and Fatigue in 360VR Video Quality Assessments (2020)
Automatic soccer field of play registration (2020)   
Augmented reality tool for the situational awareness improvement of UAV operators (2017)
Detection of static moving objects using multiple nonparametric background-foreground models on a Finite State Machine (2015)
Real-time nonparametric background subtraction with tracking-based foreground update (2015)  
Camera localization using trajectories and maps (2014)

 

                                                                                                                                                                                                                             
 
                                                                   
 
                                                                                                                                                             
 
      

 

 

6G-XR: Behind the IMMVIEX experience

Imagine being able to travel without moving. To explore history, art, and culture just by putting on a pair of glasses. This is how IMMVIEX begins — an innovative use case developed within the European project 6G-XR, transforming the way we experience cultural heritage. This immersive journey combines extended reality, 6G connectivity, and volumetric technology to take users inside the Cathedral of Sigüenza without leaving home. A place where distance disappears, presence becomes real, and history comes to life in every gesture — merging technology, emotion, and culture into one experience.

 

 

Behind IMMVIEX lies a powerful 6G infrastructure and strong collaboration between the Universidad Politécnica de Madrid (UPM) and i2CAT. From the 6G-XR testbed in Barcelona, the Image Processing Group (UPM) has deployed and validated an end-to-end XR service that pushed the network to its limits, achieving hundreds of Mbps in uplink throughput and intensive GPU usage through edge computing. The system integrates advanced connectivity over 6G and Edge Computing networks — combining an outdoor 6G cell, a dedicated wired VLAN, and an indoor Wi-Fi network — along with volumetric capture using the Free Viewpoint System and interactive 3D environments recreated in Unity. Thanks to this architecture, IMMVIEX enables a virtual guide to be teleported in real time into a 3D reconstruction of the Cathedral of Sigüenza, interacting with the environment through AI-driven hand gestures.

The result is a successful demonstration that highlights the maturity and robustness of the 6G-XR testbed, showing how next-generation connectivity, volumetric capture, and immersive rendering can revolutionize the way we experience cultural heritage. The videos showcase both the full immersive experience and the technical explanation behind its development, featuring Mario Montagud (i2CAT) and Julián Cabrera (UPM), who share the challenges and achievements of this pioneering project.

IMMVIEX is a window to the future — where technology connects people, places, and cultures beyond physical boundaries.