Search Results for author: Diandian Guo

Found 5 papers, 2 papers with code

Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms

no code implementations • 14 Apr 2024 • Diandian Guo, Manxi Lin, Jialun Pei, He Tang, Yueming Jin, Pheng-Ann Heng

A comprehensive understanding of surgical scenes allows for monitoring of the surgical process, reducing the occurrence of accidents and enhancing efficiency for medical professionals.

Graph Generation Scene Graph Generation

Paper
Add Code

S^2Former-OR: Single-Stage Bimodal Transformer for Scene Graph Generation in OR

no code implementations • 22 Feb 2024 • Jialun Pei, Diandian Guo, Jingyang Zhang, Manxi Lin, Yueming Jin, Pheng-Ann Heng

In this study, we introduce a novel single-stage bimodal transformer framework for SGG in the OR, termed S^2Former-OR, aimed to complementally leverage multi-view 2D scenes and 3D point clouds for SGG in an end-to-end manner.

Graph Generation object-detection +3

Paper
Add Code

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

1 code implementation • 27 Jan 2024 • Diandian Guo, Deng-Ping Fan, Tongyu Lu, Christos Sakaridis, Luc van Gool

The estimation of implicit cross-frame correspondences and the high computational cost have long been major challenges in video semantic segmentation (VSS) for driving scenes.

Motion Estimation Segmentation +2

Paper
Code

A Semi-Paired Approach For Label-to-Image Translation

no code implementations • 23 Jun 2023 • George Eskandar, Shuai Zhang, Mohamed Abdelsamad, Mark Youssef, Diandian Guo, Bin Yang

Data efficiency, or the ability to generalize from a few labeled data, remains a major challenge in deep learning.

Image-to-Image Translation Translation

Paper
Add Code

Towards Pragmatic Semantic Image Synthesis for Urban Scenes

1 code implementation • 16 May 2023 • George Eskandar, Diandian Guo, Karim Guirguis, Bin Yang

Second, in contrast to previous works which employ one discriminator that overfits the target domain semantic distribution, we employ a discriminator for the whole image and multiscale discriminators on the image patches.

Autonomous Driving Image Generation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.