Communication-Aware Implicit Neural Fields for Outdoor LiDAR Scene Reconstruction

dc.contributor.authorAbrar, S M Rifaiya
dc.contributor.facultyfi=Tekniikan ja innovaatiojohtamisen yksikkö|en=School of Technology and Innovations|
dc.contributor.organizationfi=Vaasan yliopisto|en=University of Vaasa|
dc.date.accessioned2026-06-18T07:54:32Z
dc.date.issued2026-06-05
dc.description.abstractThe problem of transmission and reconstruction of 3D geometry over wireless channels is an important one in autonomous systems and robotics, where geometry plays an important role, and bandwidth is a critical resource. The current methods for modelling geometry typically represent it as a list of 3D points, which is challenging to maintain continuous surface structure through downsampling to send geometry. The effects are a loss of spatial detail and unevenly reconstructed scenes particularly in outdoor environments with large areas to deal with. This thesis presents a model that combines implicit neural distance fields with a multi-scale latent representation which is built for wireless transmission. One recent model, called LightNDF, is a lightweight implicit neural field model that reconstructs continuous 3D geometry from voxel occupancy grids using multi-scale CNN features, but was not intended for any kind of transmission function. This work adopts LightNDF as its backbone and significantly extends it for communication-aware deployment. The main contribution is the introduction of a structured latent pyramid which compresses the scene into three spatial pyramids of different resolution. Due to this, the transmission size is decreased from about 218 MB per sample to 0.57 MB. Joint source-channel coding is directly operated in the latent space, and their combination of a bottleneck channel encoder, occupancy-aware masking, a residual cross-scale coding, and an SNR-adaptive gating model, are all beneficial to robustness in the presence of channel noise. Multi-scale features are reconstructed through query based decoding into unsigned distance value at randomized location. Then, outputs are analysed within the SHINE-Mapping pipeline, measuring the spatial consistency between frames. These experiments are performed on both KITTI and NewerCollege datasets with SNR of 5, 10 and 20 dB, and it can be observed that the proposed model achieves a more stable reconstruction with respect to SEPT, a state-of-the-art wireless point cloud transmission model. Unlike SEPT which achieves better compression by using a single global latent vector, the proposed model maintains the spatial structures across scales and has a better tolerance to channel noise. Results indicate that in the context of mapping and robotics, a spatially structured latent representation is superior to compact single-vector compression despite having a slightly higher transmission cost.
dc.description.notificationfi=Opinnäytetyö kokotekstinä PDF-muodossa.|en=Thesis fulltext in PDF format.|sv=Lärdomsprov tillgängligt som fulltext i PDF-format|
dc.format.contentfi=kokoteksti|en=fulltext|
dc.format.extent76
dc.identifier.urihttps://osuva.uwasa.fi/handle/11111/20981
dc.identifier.urnURN:NBN:fi-fe2026060564357
dc.language.isoeng
dc.rightsCC BY-NC 4.0
dc.subject.degreeprogrammeMaster's Programme in Sustainable and Autonomus Systems (SAS)
dc.subject.disciplineSustainable and Autonomous Systems
dc.subject.ysolidar
dc.subject.ysowireless data transmission
dc.subject.ysomodelling (representation)
dc.subject.ysodeep learning
dc.subject.ysoneural networks (information technology)
dc.subject.ysomachine learning
dc.subject.ysosignal processing
dc.subject.ysorobotics
dc.subject.ysocomputer vision
dc.titleCommunication-Aware Implicit Neural Fields for Outdoor LiDAR Scene Reconstruction
dc.type.ontasotfi=Pro gradu -tutkielma|en=Master's thesis|sv=Pro gradu -avhandling|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Uwasa_2026_Abrar_S_M_Rifaiya.pdf
Size:
2.77 MB
Format:
Adobe Portable Document Format