Search Results for author: Ngan Le

Found 56 papers, 28 papers with code

S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling

no code implementations • 7 May 2024 • Minh Tran, Adrian de Luis, Haitao Liao, Ying Huang, Roy McCann, Alan Mantooth, Jack Cothren, Ngan Le

To meet this need, we introduce S3Former, designed to segment solar panels from aerial imagery and provide size and location information critical for analyzing the impact of such installations on the grid.

Self-Supervised Learning

Paper
Add Code

CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect

no code implementations • 17 Apr 2024 • Minh Tran, Sang Truong, Arthur F. A. Fernandes, Michael T. Kidd, Ngan Le

This study proposes an effective approach for automating the assessment of carcass quality without requiring skilled labor or inspector involvement.

Defect Detection

Paper
Add Code

Unifying Global and Local Scene Entities Modelling for Precise Action Spotting

1 code implementation • 15 Apr 2024 • Kim Hoang Tran, Phuc Vuong Do, Ngoc Quoc Ly, Ngan Le

However, these approaches tend to overlook the nuances of the scene and struggle with detecting actions that occupy a small portion of the frame.

Action Spotting Avg +1

Paper
Code

ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation

no code implementations • 18 Mar 2024 • Minh Tran, Winston Bounsavy, Khoa Vo, Anh Nguyen, Tri Nguyen, Ngan Le

Consequently, this compromised quality of visible features during the subsequent visible-to-amodal transition.

Amodal Instance Segmentation Semantic Segmentation

Paper
Add Code

WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

no code implementations • 15 Dec 2023 • Huy Le, Tung Kieu, Anh Nguyen, Ngan Le

Text-video retrieval, a prominent sub-field within the domain of multimodal information retrieval, has witnessed remarkable growth in recent years.

Information Retrieval Knowledge Distillation +3

Paper
Add Code

TSRNet: Simple Framework for Real-time ECG Anomaly Detection with Multimodal Time and Spectrogram Restoration Network

1 code implementation • 15 Dec 2023 • Nhat-Tan Bui, Dinh-Hieu Hoang, Thinh Phan, Minh-Triet Tran, Brijesh Patel, Donald Adjeroh, Ngan Le

As a result, we introduce a specialized network called the Multimodal Time and Spectrogram Restoration Network (TSRNet) designed specifically for detecting anomalies in ECG signals.

Anomaly Detection Time Series

Paper
Code

PGS: Pose-Guided Supervision for Mitigating Clothes-Changing in Person Re-Identification

1 code implementation • 9 Dec 2023 • Quoc-Huy Trinh, Nhat-Tan Bui, Dinh-Hieu Hoang, Phuoc-Thao Vo Thi, Hai-Dang Nguyen, Debesh Jha, Ulas Bagci, Ngan Le, Minh-Triet Tran

Person Re-Identification (Re-ID) task seeks to enhance the tracking of multiple individuals by surveillance cameras.

Clothes Changing Person Re-Identification Person Retrieval +2

Paper
Code

ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection

no code implementations • 1 Nov 2023 • Thinh Phan, Khoa Vo, Duy Le, Gianfranco Doretto, Donald Adjeroh, Ngan Le

Temporal action detection (TAD) involves the localization and classification of action instances within untrimmed videos.

Action Detection Classification +3

Paper
Add Code

SolarFormer: Multi-scale Transformer for Solar PV Profiling

no code implementations • 30 Oct 2023 • Adrian de Luis, Minh Tran, Taisei Hanyu, Anh Tran, Liao Haitao, Roy McCann, Alan Mantooth, Ying Huang, Ngan Le

Accurate mapping of PV installations is crucial for understanding their adoption and informing energy policy.

Decoder

Paper
Add Code

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

1 code implementation • 5 Oct 2023 • Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le

Open-Fusion harnesses the power of a pre-trained vision-language foundation model (VLFM) for open-set semantic comprehension and employs the Truncated Signed Distance Function (TSDF) for swift 3D scene reconstruction.

3D Scene Reconstruction

Paper
Code

I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR Diagnoses

1 code implementation • 24 Sep 2023 • Trong Thang Pham, Jacob Brecheisen, Anh Nguyen, Hien Nguyen, Ngan Le

In the field of chest X-ray (CXR) diagnosis, existing works often focus solely on determining where a radiologist looks, typically through tasks such as detection, segmentation, or classification.

Language Modelling

Paper
Code

SAM3D: Segment Anything Model in Volumetric Medical Images

2 code implementations • 7 Sep 2023 • Nhat-Tan Bui, Dinh-Hieu Hoang, Minh-Triet Tran, Gianfranco Doretto, Donald Adjeroh, Brijesh Patel, Arabinda Choudhary, Ngan Le

Image segmentation remains a pivotal component in medical image analysis, aiding in the extraction of critical information for precise diagnostic practices.

Image Segmentation Segmentation +1

Paper
Code

MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation

2 code implementations • 6 Sep 2023 • Nhat-Tan Bui, Dinh-Hieu Hoang, Quang-Thuc Nguyen, Minh-Triet Tran, Ngan Le

MEGANet is designed as an end-to-end framework, encompassing three key modules: an encoder, which is responsible for capturing and abstracting the features from the input image, a decoder, which focuses on salient features, and the Edge-Guided Attention module (EGA) that employs the Laplacian Operator to accentuate polyp boundaries.

Decoder Edge Detection +1

Paper
Code

ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey

1 code implementation • 9 Jul 2023 • Salman Mohamadi, Ghulam Mujtaba, Ngan Le, Gianfranco Doretto, Donald A. Adjeroh

We also lay out essential foundational literature on LLMs and GAI in general and their connection with ChatGPT.

Language Modelling Large Language Model

Paper
Code

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

1 code implementation • 12 Jun 2023 • Kashu Yamazaki, Taisei Hanyu, Minh Tran, Adrian de Luis, Roy McCann, Haitao Liao, Chase Rainwater, Meredith Adkins, Jackson Cothren, Ngan Le

Aerial Image Segmentation is a top-down perspective semantic segmentation and has several challenging characteristics such as strong imbalance in the foreground-background distribution, complex background, intra-class heterogeneity, inter-class homogeneity, and tiny objects.

Ranked #1 on Semantic Segmentation on ISPRS Potsdam

Decoder Image Segmentation +2

Paper
Code

Translating Simulation Images to X-ray Images via Multi-Scale Semantic Matching

no code implementations • 16 Apr 2023 • Jingxuan Kang, Tudor Jianu, Baoru Huang, Binod Bhattarai, Ngan Le, Frans Coenen, Anh Nguyen

In this paper, we propose a new method to translate simulation images from an endovascular simulator to X-ray images.

Image-to-Image Translation

Paper
Add Code

FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding

1 code implementation • CVPR 2023 • Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson Cothren, Khoa Luu

Although Domain Adaptation in Semantic Scene Segmentation has shown impressive improvement in recent years, the fairness concerns in the domain adaptation have yet to be well defined and addressed.

Ranked #5 on Domain Adaptation on SYNTHIA-to-Cityscapes

Autonomous Driving Domain Adaptation +4

Paper
Code

Open-Vocabulary Affordance Detection in 3D Point Clouds

1 code implementation • 4 Mar 2023 • Toan Nguyen, Minh Nhat Vu, An Vuong, Dzung Nguyen, Thieu Vo, Ngan Le, Anh Nguyen

In this paper, we present the Open-Vocabulary Affordance Detection (OpenAD) method, which is capable of detecting an unbounded number of affordances in 3D point clouds.

Affordance Detection

Paper
Code

DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and Classification for Diabetic Retinopathy Grading

no code implementations • 30 Dec 2022 • Hasan Md Tusfiqur, Duy M. H. Nguyen, Mai T. N. Truong, Triet A. Nguyen, Binh T. Nguyen, Michael Barz, Hans-Juergen Profitlich, Ngoc T. T. Than, Ngan Le, Pengtao Xie, Daniel Sonntag

Diabetic Retinopathy (DR) is a leading cause of vision loss in the world, and early DR detection is necessary to prevent vision loss and support an appropriate treatment.

Diabetic Retinopathy Grading Lesion Segmentation +1

Paper
Add Code

Contextual Explainable Video Representation: Human Perception-based Understanding

1 code implementation • 12 Dec 2022 • Khoa Vo, Kashu Yamazaki, Phong X. Nguyen, Phat Nguyen, Khoa Luu, Ngan Le

We choose video paragraph captioning and temporal action detection to illustrate the effectiveness of human perception based-contextual representation in video understanding.

Action Detection Action Recognition +4

Paper
Code

CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection

1 code implementation • 9 Dec 2022 • Hyekang Kevin Joo, Khoa Vo, Kashu Yamazaki, Ngan Le

Video anomaly detection (VAD) -- commonly formulated as a multiple-instance learning problem in a weakly-supervised manner due to its labor-intensive nature -- is a challenging problem in video surveillance where the frames of anomaly need to be localized in an untrimmed video.

Anomaly Detection Multiple Instance Learning +1

Paper
Code

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

1 code implementation • 28 Nov 2022 • Kashu Yamazaki, Khoa Vo, Sang Truong, Bhiksha Raj, Ngan Le

Video paragraph captioning aims to generate a multi-sentence description of an untrimmed video with several temporal event locations in coherent storytelling.

Ranked #2 on Video Captioning on ActivityNet Captions

Sentence Video Captioning

Paper
Code

Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach

no code implementations • 17 Nov 2022 • Pha Nguyen, Kha Gia Quach, Chi Nhan Duong, Son Lam Phung, Ngan Le, Khoa Luu

The development of autonomous vehicles generates a tremendous demand for a low-cost solution with a complete set of camera sensors capturing the environment around the car.

3D Object Detection Autonomous Vehicles +3

Paper
Add Code

AISFormer: Amodal Instance Segmentation with Transformer

1 code implementation • 12 Oct 2022 • Minh Tran, Khoa Vo, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le

AISFormer explicitly models the complex coherence between occluder, visible, amodal, and invisible masks within an object's regions of interest by treating them as learnable queries.

Amodal Instance Segmentation Decoder +2

Paper
Code

EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification

1 code implementation • 7 Oct 2022 • Tien-Phat Nguyen, Trong-Thang Pham, Tri Nguyen, Hieu Le, Dung Nguyen, Hau Lam, Phong Nguyen, Jennifer Fowler, Minh-Triet Tran, Ngan Le

The transformer expanding path models the temporal coherency between embryo images to ensure monotonic non-decreasing constraint and is optimized by a segmentation head.

Decoder

Paper
Code

AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

1 code implementation • 5 Oct 2022 • Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le

PMR module represents each video snippet by a visual-linguistic feature, in which main actors and surrounding environment are represented by visual information, whereas relevant objects are depicted by linguistic features through an image-text model.

Ranked #1 on Temporal Action Proposal Generation on ActivityNet-1.3

Action Detection Temporal Action Proposal Generation

Paper
Code

Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning

1 code implementation • 30 Sep 2022 • Thinh Phan, Duc Le, Patel Brijesh, Donald Adjeroh, Jingxian Wu, Morten Olgaard Jensen, Ngan Le

Electrocardiogram (ECG) signal is one of the most effective sources of information mainly employed for the diagnosis and prediction of cardiovascular diseases (CVDs) connected with the abnormalities in heart rhythm.

ECG Classification Self-Knowledge Distillation +3

Paper
Code

Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition

no code implementations • 11 Sep 2022 • Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Marios Savvides, Khoa Luu

We therefore introduce a new method named Attention-based Bijective Generative Adversarial Networks in a Distillation framework (DAB-GAN) to synthesize faces of a subject given his/her extracted face recognition features.

Face Recognition Face Reconstruction +2

Paper
Add Code

VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

1 code implementation • 26 Jun 2022 • Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

In this paper, we leverage the human perceiving process, that involves vision and language interaction, to generate a coherent paragraph description of untrimmed videos.

Ranked #3 on Video Captioning on ActivityNet Captions

Contrastive Learning Video Captioning

Paper
Code

Self-supervised Domain Adaptation in Crowd Counting

no code implementations • 7 Jun 2022 • Pha Nguyen, Thanh-Dat Truong, Miaoqing Huang, Yi Liang, Ngan Le, Khoa Luu

Self-training crowd counting has not been attentively explored though it is one of the important challenges in computer vision.

Crowd Counting Domain Adaptation

Paper
Add Code

OTAdapt: Optimal Transport-based Approach For Unsupervised Domain Adaptation

no code implementations • 22 May 2022 • Thanh-Dat Truong, Naga Venkata Sai Raviteja Chappa, Xuan Bac Nguyen, Ngan Le, Ashley Dowling, Khoa Luu

Unsupervised domain adaptation is one of the challenging problems in computer vision.

Object Recognition Unsupervised Domain Adaptation

Paper
Add Code

Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles

no code implementations • 19 Apr 2022 • Pha Nguyen, Kha Gia Quach, Chi Nhan Duong, Ngan Le, Xuan-Bac Nguyen, Khoa Luu

The experimental results on the nuScenes dataset demonstrate the benefits of the proposed method to produce SOTA performance on the existing vision-based tracking dataset.

3D Object Detection 3D Object Tracking +5

Paper
Add Code

CapsNet for Medical Image Segmentation

no code implementations • 16 Mar 2022 • Minh Tran, Viet-Khoa Vo-Ho, Kyle Quinn, Hien Nguyen, Khoa Luu, Ngan Le

We then provide recent developments of CapsNet for the task of medical image segmentation.

Image Segmentation Representation Learning +3

Paper
Add Code

Meta-Learning of NAS for Few-shot Learning in Medical Image Applications

no code implementations • 16 Mar 2022 • Viet-Khoa Vo-Ho, Kashu Yamazaki, Hieu Hoang, Minh-Triet Tran, Ngan Le

To address such limitations, meta-learning has been adopted in the scenarios of few-shot learning and multiple tasks.

Few-Shot Learning Image Classification +1

Paper
Add Code

ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation

1 code implementation • 16 Mar 2022 • Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, Ngan Le

Temporal action proposal generation (TAPG) aims to estimate temporal intervals of actions in untrimmed videos, which is a challenging yet plays an important role in many tasks of video analysis and understanding.

Ranked #4 on Temporal Action Proposal Generation on ActivityNet-1.3

Action Detection Temporal Action Proposal Generation

Paper
Code

Point-Unet: A Context-aware Point-based Neural Network for Volumetric Segmentation

1 code implementation • 16 Mar 2022 • Ngoc-Vuong Ho, Tan Nguyen, Gia-Han Diep, Ngan Le, Binh-Son Hua

In this paper, we propose Point-Unet, a novel method that incorporates the efficiency of deep learning with 3D point clouds into volumetric segmentation.

Image Segmentation Medical Image Segmentation +2

Paper
Code

3D-UCaps: 3D Capsules Unet for Volumetric Image Segmentation

2 code implementations • 16 Mar 2022 • Tan Nguyen, Binh-Son Hua, Ngan Le

Medical image segmentation has been so far achieving promising results with Convolutional Neural Networks (CNNs).

Hippocampus Image Segmentation +3

Paper
Code

SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical Segmentation on Less Labeled Data

no code implementations • 15 Jan 2022 • Minh Tran, Loi Ly, Binh-Son Hua, Ngan Le

Capsule network is a recent new deep network architecture that has been applied successfully for medical image segmentation tasks.

Decoder Hippocampus +5

Paper
Add Code

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

1 code implementation • 21 Oct 2021 • Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, Ngan Le

In this paper, we make an attempt to simulate that ability of a human by proposing Actor Environment Interaction (AEI) network to improve the video representation for temporal action proposals generation.

Ranked #2 on Temporal Action Proposal Generation on ActivityNet-1.3

Action Detection Temporal Action Proposal Generation

Paper
Code

Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey

no code implementations • 25 Aug 2021 • Ngan Le, Vidhiwar Singh Rathour, Kashu Yamazaki, Khoa Luu, Marios Savvides

In this work, we provide a detailed review of recent and state-of-the-art research advances of deep reinforcement learning in computer vision.

Image Segmentation object-detection +5

Paper
Add Code

The Right to Talk: An Audio-Visual Transformer Approach

1 code implementation • ICCV 2021 • Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu

Therefore, this work introduces a new Audio-Visual Transformer approach to the problem of localization and highlighting the main speaker in both audio and visual channels of a multi-speaker conversation video in the wild.

Paper
Code

BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation

1 code implementation • ICCV 2021 • Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Son Lam Phung, Chase Rainwater, Khoa Luu

Semantic segmentation aims to predict pixel-level labels.

Ranked #19 on Unsupervised Domain Adaptation on GTAV-to-Cityscapes Labels

Scene Segmentation Segmentation +1

Paper
Code

Multi-module Recurrent Convolutional Neural Network with Transformer Encoder for ECG Arrhythmia Classification

1 code implementation • IEEE EMBS 2021 • Minh Duc Le, Vidhiwar Singh Rathour, Quang Sang Truong, Quan Mai, Patel Brijesh, Ngan Le

The automatic classification of electrocardiogram (ECG) signals has played an important role in cardiovascular diseases diagnosis and prediction.

Time Series Time Series Analysis

Paper
Code

Agent-Environment Network for Temporal Action Proposal Generation

no code implementations • 17 Jul 2021 • Viet-Khoa Vo-Ho, Ngan Le, Kashu Yamazaki, Akihiro Sugimoto, Minh-Triet Tran

Temporal action proposal generation is an essential and challenging task that aims at localizing temporal intervals containing human actions in untrimmed videos.

Temporal Action Proposal Generation

Paper
Add Code

Offset Curves Loss for Imbalanced Problem in Medical Segmentation

no code implementations • 4 Dec 2020 • Ngan Le, Trung Le, Kashu Yamazaki, Toan Duc Bui, Khoa Luu, Marios Savides

Our proposed Offset Curves (OsC) loss consists of three main fitting terms.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

A Multi-task Contextual Atrous Residual Network for Brain Tumor Detection & Segmentation

no code implementations • 3 Dec 2020 • Ngan Le, Kashu Yamazaki, Dat Truong, Kha Gia Quach, Marios Savvides

The first objective is performed by our proposed contextual brain tumor detection network, which plays a role of an attention gate and focuses on the region around brain tumor only while ignoring the far neighbor background which is less correlated to the tumor.

Brain Tumor Segmentation Tumor Segmentation

Paper
Add Code

Flow-based Deformation Guidance for Unpaired Multi-Contrast MRI Image-to-Image Translation

no code implementations • 3 Dec 2020 • Toan Duc Bui, Manh Nguyen, Ngan Le, Khoa Luu

To capture temporal structures in the medical images, we explore the displacement between the consecutive slices using a deformation field.

Generative Adversarial Network Image-to-Image Translation +1

Paper
Add Code

LIAAD: Lightweight Attentive Angular Distillation for Large-scale Age-Invariant Face Recognition

no code implementations • 9 Apr 2020 • Thanh-Dat Truong, Chi Nhan Duong, Kha Gia Quach, Ngan Le, Tien D. Bui, Khoa Luu

This work presents a novel Lightweight Attentive Angular Distillation (LIAAD) approach to Large-scale Lightweight AiFR that overcomes these limitations.

Age-Invariant Face Recognition

Paper
Add Code

Domain Generalization via Universal Non-volume Preserving Models

no code implementations • 28 May 2019 • Thanh-Dat Truong, Chi Nhan Duong, Khoa Luu, Minh-Triet Tran, Ngan Le

However, it has been largely overlooked in the problem of recognition in new unseen domains.

Domain Generalization Face Recognition +2

Paper
Add Code

Image Alignment in Unseen Domains via Domain Deep Generalization

no code implementations • 28 May 2019 • Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

This paper presents a novel deep learning based approach to tackle the problem of across unseen modalities.

Domain Adaptation

Paper
Add Code

ShrinkTeaNet: Million-scale Lightweight Face Recognition via Shrinking Teacher-Student Networks

2 code implementations • 25 May 2019 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Ngan Le

In addition, this work introduces a novel Angular Distillation Loss for distilling the feature direction and the sample distributions of the teacher's hypersphere to its student.

Lightweight Face Recognition

Paper
Code

Fast Flow Reconstruction via Robust Invertible nxn Convolution

no code implementations • 24 May 2019 • Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

The experiments on CIFAR-10, ImageNet and Celeb-HQ datasets, have shown that our invertible $n \times n$ convolution helps to improve the performance of generative models significantly.

Paper
Add Code

Non-Volume Preserving-based Fusion to Group-Level Emotion Recognition on Crowd Videos

no code implementations • 28 Nov 2018 • Kha Gia Quach, Ngan Le, Chi Nhan Duong, Ibsa Jalata, Kaushik Roy, Khoa Luu

To demonstrate the robustness and effectiveness of each component in the proposed approach, three experiments were conducted: (i) evaluation on AffectNet database to benchmark the proposed EmoNet for recognizing facial expression; (ii) evaluation on EmotiW2018 to benchmark the proposed deep feature level fusion mechanism NVPF; and, (iii) examine the proposed TNVPF on an innovative Group-level Emotion on Crowd Videos (GECV) dataset composed of 627 videos collected from publicly available sources.

Emotion Recognition

Paper
Add Code

Automatic Face Aging in Videos via Deep Reinforcement Learning

no code implementations • CVPR 2019 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Nghia Nguyen, Eric Patterson, Tien D. Bui, Ngan Le

This paper presents a novel approach to synthesize automatically age-progressed facial images in video sequences using Deep Reinforcement Learning.

Face Verification reinforcement-learning +1

Paper
Add Code

MobiFace: A Lightweight Deep Learning Face Recognition on Mobile Devices

no code implementations • 27 Nov 2018 • Chi Nhan Duong, Kha Gia Quach, Ibsa Jalata, Ngan Le, Khoa Luu

Deep neural networks have been widely used in numerous computer vision applications, particularly in face recognition.

Face Recognition

Paper
Add Code

Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation

1 code implementation • 12 Apr 2017 • Ngan Le, Kha Gia Quach, Khoa Luu, Marios Savvides, Chenchen Zhu

To address these issues and boost the classic variational LS methods to a new level of the learnable deep learning approaches, we propose a novel definition of contour evolution named Recurrent Level Set (RLS)} to employ Gated Recurrent Unit under the energy minimization of a variational LS functional.

Segmentation Semantic Segmentation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.