Search Results for author: Tat-Seng Chua

Found 251 papers, 156 papers with code

Learning to Imagine: Integrating Counterfactual Thinking in Neural Discrete Reasoning

no code implementations • ACL 2022 • Moxin Li, Fuli Feng, Hanwang Zhang, Xiangnan He, Fengbin Zhu, Tat-Seng Chua

Neural discrete reasoning (NDR) has shown remarkable progress in combining deep models with discrete reasoning.

Paper
Add Code

Re-examining the Role of Schema Linking in Text-to-SQL

no code implementations • EMNLP 2020 • Wenqiang Lei, Weixin Wang, Zhixin Ma, Tian Gan, Wei Lu, Min-Yen Kan, Tat-Seng Chua

By providing a schema linking corpus based on the Spider text-to-SQL dataset, we systematically study the role of schema linking.

Text-To-SQL

Paper
Add Code

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

1 code implementation • 27 May 2024 • Tianyu Yu, Haoye Zhang, Yuan YAO, Yunkai Dang, Da Chen, Xiaoman Lu, Ganqu Cui, Taiwen He, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun

While traditional methods rely on labor-intensive and time-consuming manual labeling, recent approaches employing models as automatic labelers have shown promising results without human intervention.

Paper
Code

ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text Pretraining

1 code implementation • 23 May 2024 • Zhiyuan Liu, Yaorui Shi, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua

To resolve the challenges above, we propose a new pretraining method, ReactXT, for reaction-text modeling, and a new dataset, OpenExp, for experimental procedure prediction.

Molecule Captioning Retrosynthesis

Paper
Code

ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation

no code implementations • 23 May 2024 • Jingnan Zheng, Han Wang, An Zhang, Tai D. Nguyen, Jun Sun, Tat-Seng Chua

Systematic analysis also validates that the generated test scenarios represent meaningful use cases, as well as integrate enhanced measures to probe long-tail risks.

Paper
Add Code

Towards Comprehensive and Efficient Post Safety Alignment of Large Language Models via Safety Patching

no code implementations • 22 May 2024 • Weixiang Zhao, Yulin Hu, Zhuojun Li, Yang Deng, Yanyan Zhao, Bing Qin, Tat-Seng Chua

Safety alignment of large language models (LLMs) has been gaining increasing attention.

Paper
Add Code

ProtT3: Protein-to-Text Generation for Text-based Protein Understanding

1 code implementation • 21 May 2024 • Zhiyuan Liu, An Zhang, Hao Fei, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua

ProtT3 empowers an LM to understand protein sequences of amino acids by incorporating a PLM as its protein understanding module, enabling effective protein-to-text generation.

Property Prediction Question Answering +3

Paper
Code

STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents

no code implementations • 20 May 2024 • Yue Chen, Chen Huang, Yang Deng, Wenqiang Lei, dingnan jin, Jia Liu, Tat-Seng Chua

However, they still struggle to deliver promising performance on unseen domains, struggling to achieve effective domain transferability.

Conversational Search Language Modelling +1

Paper
Add Code

CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models

1 code implementation • 20 May 2024 • Tong Zhang, Peixin Qin, Yang Deng, Chen Huang, Wenqiang Lei, Junhong Liu, dingnan jin, Hongru Liang, Tat-Seng Chua

To this end, we introduce CLAMBER, a benchmark for evaluating LLMs using a well-organized taxonomy.

Paper
Code

Co-Matching: Towards Human-Machine Collaborative Legal Case Matching

no code implementations • 16 May 2024 • Chen Huang, Xinwei Yang, Yang Deng, Wenqiang Lei, Jiancheng Lv, Tat-Seng Chua

However, successful legal case matching requires the tacit knowledge of legal practitioners, which is difficult to verbalize and encode into machines.

Paper
Add Code

Learnable Tokenizer for LLM-based Generative Recommendation

no code implementations • 12 May 2024 • Wenjie Wang, Honghui Bao, Xinyu Lin, Jizhi Zhang, Yongqi Li, Fuli Feng, See-Kiong Ng, Tat-Seng Chua

To address these shortcomings, we propose LETTER (a LEarnable Tokenizer for generaTivE Recommendation), designed to meet the key criteria of identifiers by integrating hierarchical semantics, collaborative signals, and code assignment diversity.

World Knowledge

Paper
Add Code

A Survey on RAG Meets LLMs: Towards Retrieval-Augmented Large Language Models

no code implementations • 10 May 2024 • Yujuan Ding, Wenqi Fan, Liangbo Ning, Shijie Wang, Hengyun Li, Dawei Yin, Tat-Seng Chua, Qing Li

Given the powerful abilities of RAG in providing the latest and helpful auxiliary information, retrieval-augmented large language models have emerged to harness external and authoritative knowledge bases, rather than solely relying on the model's internal knowledge, to augment the generation quality of LLMs.

Information Retrieval Retrieval

Paper
Add Code

Auto-Encoding Morph-Tokens for Multimodal LLM

1 code implementation • 3 May 2024 • Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang

For multimodal LLMs, the synergy of visual comprehension (textual output) and generation (visual output) presents an ongoing challenge.

Image Reconstruction MORPH

Paper
Code

A Taxation Perspective for Fair Re-ranking

1 code implementation • 27 Apr 2024 • Chen Xu, Xiaopeng Ye, Wenjie Wang, Liang Pang, Jun Xu, Tat-Seng Chua

From a taxation perspective, we theoretically demonstrate that most previous fair re-ranking methods can be reformulated as an item-level tax policy.

Ethics Fairness +1

Paper
Code

A Survey of Generative Search and Recommendation in the Era of Large Language Models

no code implementations • 25 Apr 2024 • Yongqi Li, Xinyu Lin, Wenjie Wang, Fuli Feng, Liang Pang, Wenjie Li, Liqiang Nie, Xiangnan He, Tat-Seng Chua

With the information explosion on the Web, search and recommendation are foundational infrastructures to satisfying users' information needs.

Paper
Add Code

Towards Human-centered Proactive Conversational Agents

no code implementations • 19 Apr 2024 • Yang Deng, Lizi Liao, Zhonghua Zheng, Grace Hui Yang, Tat-Seng Chua

Recent research on proactive conversational agents (PCAs) mainly focuses on improving the system's capabilities in anticipating and planning action sequences to accomplish tasks and achieve goals before users articulate their requests.

Information Retrieval Retrieval

Paper
Add Code

Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales

no code implementations • 17 Apr 2024 • Minghe Gao, Shuang Chen, Liang Pang, Yuan YAO, Jisheng Dang, Wenqiao Zhang, Juncheng Li, Siliang Tang, Yueting Zhuang, Tat-Seng Chua

Their ability to execute intricate compositional reasoning tasks is also constrained, culminating in a stagnation of learning progression for these models.

Hallucination

Paper
Add Code

Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors

1 code implementation • 4 Apr 2024 • Chen Huang, Peixin Qin, Yang Deng, Wenqiang Lei, Jiancheng Lv, Tat-Seng Chua

The conversational recommendation system (CRS) has been criticized regarding its user experience in real-world scenarios, despite recent significant progress achieved in academia.

Recommendation Systems

Paper
Code

CIRP: Cross-Item Relational Pre-training for Multimodal Product Bundling

no code implementations • 2 Apr 2024 • Yunshan Ma, Yingzhi He, Wenjun Zhong, Xiang Wang, Roger Zimmermann, Tat-Seng Chua

However, the cross-item relations have been under-explored in the current multimodal pre-train models.

Graph Learning Marketing +2

Paper
Add Code

A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning

no code implementations • 22 Mar 2024 • Changmeng Zheng, Dayong Liang, WengYu Zhang, Xiao-Yong Wei, Tat-Seng Chua, Qing Li

The study addresses two key challenges: the trivialization of opinions resulting from excessive summarization and the diversion of focus caused by distractor concepts introduced from images.

Multimodal Reasoning

Paper
Add Code

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

3 code implementations • 18 Mar 2024 • Ruyi Xu, Yuan YAO, Zonghao Guo, Junbo Cui, Zanlin Ni, Chunjiang Ge, Tat-Seng Chua, Zhiyuan Liu, Maosong Sun, Gao Huang

To address the challenges, we present LLaVA-UHD, a large multimodal model that can efficiently perceive images in any aspect ratio and high resolution.

4,017

Paper
Code

Think Twice Before Assure: Confidence Estimation for Large Language Models through Reflection on Multiple Answers

no code implementations • 15 Mar 2024 • Moxin Li, Wenjie Wang, Fuli Feng, Fengbin Zhu, Qifan Wang, Tat-Seng Chua

Confidence estimation aiming to evaluate output trustability is crucial for the application of large language models (LLM), especially the black-box ones.

Paper
Add Code

FashionReGen: LLM-Empowered Fashion Report Generation

no code implementations • 11 Mar 2024 • Yujuan Ding, Yunshan Ma, Wenqi Fan, Yige Yao, Tat-Seng Chua, Qing Li

Fashion analysis refers to the process of examining and evaluating trends, styles, and elements within the fashion industry to understand and interpret its current state, generating fashion reports.

Paper
Add Code

Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning

no code implementations • 11 Mar 2024 • Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, Zujie Wen, Wenqiang Lei, Tat-Seng Chua

We investigate non-collaborative dialogue agents, which are expected to engage in strategic conversations with diverse users, for securing a mutual agreement that leans favorably towards the system's objectives.

Paper
Add Code

Discriminative Probing and Tuning for Text-to-Image Generation

no code implementations • 7 Mar 2024 • Leigang Qu, Wenjie Wang, Yongqi Li, Hanwang Zhang, Liqiang Nie, Tat-Seng Chua

We present a discriminative adapter built on T2I models to probe their discriminative abilities on two representative tasks and leverage discriminative fine-tuning to improve their text-image alignment.

Text-to-Image Generation

Paper
Add Code

Contrastive Pre-training for Deep Session Data Understanding

no code implementations • 5 Mar 2024 • Zixuan Li, Lizi Liao, Yunshan Ma, Tat-Seng Chua

In this work, we delve into deep session data understanding via scrutinizing the various clues inside the rich information in user sessions.

Contrastive Learning

Paper
Add Code

Uplift Modeling for Target User Attacks on Recommender Systems

1 code implementation • 5 Mar 2024 • Wenjie Wang, Changsheng Wang, Fuli Feng, Wentao Shi, Daizong Ding, Tat-Seng Chua

UBA estimates the treatment effect on each target user and optimizes the allocation of fake user budgets to maximize the attack performance.

Recommendation Systems

Paper
Code

Learning to Ask Critical Questions for Assisting Product Search

no code implementations • 5 Mar 2024 • Zixuan Li, Lizi Liao, Tat-Seng Chua

In this paper, we propose a dual-learning model that hybrids the best from both implicit session feedback and proactively clarifying with users on the most critical questions.

Information Retrieval

Paper
Add Code

Abductive Ego-View Accident Video Understanding for Safe Driving Perception

no code implementations • 1 Mar 2024 • Jianwu Fang, Lei-Lei Li, Junfei Zhou, Junbin Xiao, Hongkai Yu, Chen Lv, Jianru Xue, Tat-Seng Chua

This model involves a contrastive interaction loss to learn the pair co-occurrence of normal, near-accident, accident frames with the corresponding text descriptions, such as accident reasons, prevention advice, and accident categories.

Object object-detection +3

Paper
Add Code

A Survey on Neural Question Generation: Methods, Applications, and Prospects

no code implementations • 28 Feb 2024 • Shasha Guo, Lizi Liao, Cuiping Li, Tat-Seng Chua

In this survey, we present a detailed examination of the advancements in Neural Question Generation (NQG), a field leveraging neural network techniques to generate relevant questions from diverse inputs like knowledge bases, texts, and images.

Question Generation Question-Generation

Paper
Add Code

Prospect Personalized Recommendation on Large Language Model-based Agent Platform

1 code implementation • 28 Feb 2024 • Jizhi Zhang, Keqin Bao, Wenjie Wang, Yang Zhang, Wentao Shi, Wanhong Xu, Fuli Feng, Tat-Seng Chua

Additionally, we prospect the evolution of Rec4Agentverse and conceptualize it into three stages based on the enhancement of the interaction and information exchange among Agent Items, Agent Recommender, and the user.

Language Modelling Large Language Model +1

Paper
Code

GraphEdit: Large Language Models for Graph Structure Learning

1 code implementation • 23 Feb 2024 • Zirui Guo, Lianghao Xia, Yanhua Yu, Yuling Wang, Zixuan Yang, Wei Wei, Liang Pang, Tat-Seng Chua, Chao Huang

Graph Structure Learning (GSL) focuses on capturing intrinsic dependencies and interactions among nodes in graph-structured data by generating novel graph structures.

Graph structure learning

Paper
Code

On the Multi-turn Instruction Following for Conversational Web Agents

1 code implementation • 23 Feb 2024 • Yang Deng, Xuan Zhang, Wenxuan Zhang, Yifei Yuan, See-Kiong Ng, Tat-Seng Chua

Web agents powered by Large Language Models (LLMs) have demonstrated remarkable abilities in planning and executing multi-step interactions within complex web-based environments, fulfilling a wide range of web navigation tasks.

Conversational Web Navigation Instruction Following

Paper
Code

Gotcha! Don't trick me with unanswerable questions! Self-aligning Large Language Models for Responding to Unknown Questions

no code implementations • 23 Feb 2024 • Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua

Despite the remarkable abilities of Large Language Models (LLMs) to answer questions, they often display a considerable level of overconfidence even when the question does not have a definitive answer.

Paper
Add Code

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning

1 code implementation • 18 Feb 2024 • Long Qian, Juncheng Li, Yu Wu, Yaobo Ye, Hao Fei, Tat-Seng Chua, Yueting Zhuang, Siliang Tang

Large Language Models (LLMs) demonstrate remarkable proficiency in comprehending and handling text-based tasks.

Language Modelling Large Language Model

Paper
Code

Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond

no code implementations • 16 Feb 2024 • Yongqi Li, Wenjie Wang, Leigang Qu, Liqiang Nie, Wenjie Li, Tat-Seng Chua

Building upon this capability, we propose to enable multimodal large language models (MLLMs) to memorize and recall images within their parameters.

Cross-Modal Retrieval Retrieval

Paper
Add Code

Distillation Enhanced Generative Retrieval

no code implementations • 16 Feb 2024 • Yongqi Li, Zhen Zhang, Wenjie Wang, Liqiang Nie, Wenjie Li, Tat-Seng Chua

Generative retrieval is a promising new paradigm in text retrieval that generates identifier strings of relevant passages as the retrieval target.

Retrieval Text Retrieval

Paper
Add Code

LLM-based Federated Recommendation

no code implementations • 15 Feb 2024 • Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-Kiong Ng, Tat-Seng Chua

Nevertheless, applying Fed4Rec to LLM-based recommendation presents two main challenges: first, an increase in the imbalance of performance across clients, affecting the system's efficiency over time, and second, a high demand on clients' computational and storage resources for local training and inference of LLMs.

Federated Learning Language Modelling +2

Paper
Add Code

Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models

1 code implementation • 6 Feb 2024 • Kelvin J. L. Koa, Yunshan Ma, Ritchie Ng, Tat-Seng Chua

The training samples for the PPO trainer are also the responses generated during the reflective process, which eliminates the need for human annotators.

Stock Prediction

Paper
Code

Data-efficient Fine-tuning for LLM-based Recommendation

no code implementations • 30 Jan 2024 • Xinyu Lin, Wenjie Wang, Yongqi Li, Shuo Yang, Fuli Feng, Yinwei Wei, Tat-Seng Chua

To pursue the two objectives, we propose a novel data pruning method based on two scores, i. e., influence score and effort score, to efficiently identify the influential samples.

Paper
Add Code

Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction

no code implementations • 28 Jan 2024 • Kangkang Lu, Yanhua Yu, Hao Fei, Xuan Li, Zixuan Yang, Zirui Guo, Meiyu Liang, Mengran Yin, Tat-Seng Chua

Moreover, we theoretically establish that the number of distinguishable eigenvalues plays a pivotal role in determining the expressive power of spectral graph neural networks.

Node Classification

Paper
Add Code

Towards 3D Molecule-Text Interpretation in Language Models

1 code implementation • 25 Jan 2024 • Sihang Li, Zhiyuan Liu, Yanchen Luo, Xiang Wang, Xiangnan He, Kenji Kawaguchi, Tat-Seng Chua, Qi Tian

Through 3D molecule-text alignment and 3D molecule-centric instruction tuning, 3D-MoLM establishes an integration of 3D molecular encoder and LM.

Instruction Following Language Modelling +3

Paper
Code

TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data

no code implementations • 24 Jan 2024 • Fengbin Zhu, Ziyang Liu, Fuli Feng, Chao Wang, Moxin Li, Tat-Seng Chua

In this work, we address question answering (QA) over a hybrid of tabular and textual data that are very common content on the Web (e. g. SEC filings), where discrete reasoning capabilities are often required.

Language Modelling Question Answering

Paper
Add Code

Instilling Multi-round Thinking to Text-guided Image Generation

no code implementations • 16 Jan 2024 • Lidong Zeng, Zhedong Zheng, Yinwei Wei, Tat-Seng Chua

This paper delves into the text-guided image editing task, focusing on modifying a reference image according to user-specified textual feedback to embody specific attributes.

Image Generation text-guided-generation +1

Paper
Add Code

GOODAT: Towards Test-time Graph Out-of-Distribution Detection

1 code implementation • 10 Jan 2024 • Luzhi Wang, Dongxiao He, He Zhang, Yixin Liu, Wenjie Wang, Shirui Pan, Di Jin, Tat-Seng Chua

To identify and reject OOD samples with GNNs, recent studies have explored graph OOD detection, often focusing on training a specific model or modifying the data on top of a well-trained GNN.

Out-of-Distribution Detection

Paper
Code

Temporally and Distributionally Robust Optimization for Cold-Start Recommendation

1 code implementation • 15 Dec 2023 • Xinyu Lin, Wenjie Wang, Jujia Zhao, Yongqi Li, Fuli Feng, Tat-Seng Chua

They learn a feature extractor on warm-start items to align feature representations with interactions, and then leverage the feature extractor to extract the feature representations of cold-start items for interaction prediction.

Collaborative Filtering

Paper
Code

Towards Goal-oriented Intelligent Tutoring Systems in Online Education

no code implementations • 3 Dec 2023 • Yang Deng, Zifeng Ren, An Zhang, Wenqiang Lei, Tat-Seng Chua

In this work, we investigate a new task, named Goal-oriented Intelligent Tutoring Systems (GITS), which aims to enable the student's mastery of a designated concept by strategically planning a customized sequence of exercises and assessment.

cognitive diagnosis Representation Learning

Paper
Add Code

SCTc-TE: A Comprehensive Formulation and Benchmark for Temporal Event Forecasting

1 code implementation • 2 Dec 2023 • Yunshan Ma, Chenchen Ye, Zijian Wu, Xiang Wang, Yixin Cao, Liang Pang, Tat-Seng Chua

Temporal complex event forecasting aims to predict the future events given the observed events from history.

Paper
Code

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

3 code implementations • 1 Dec 2023 • Tianyu Yu, Yuan YAO, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-Tao Zheng, Maosong Sun, Tat-Seng Chua

Multimodal Large Language Models (MLLMs) have recently demonstrated impressive capabilities in multimodal understanding, reasoning, and interaction.

Hallucination

4,017

Paper
Code

MultiCBR: Multi-view Contrastive Learning for Bundle Recommendation

1 code implementation • 28 Nov 2023 • Yunshan Ma, Yingzhi He, Xiang Wang, Yinwei Wei, Xiaoyu Du, Yuyangzi Fu, Tat-Seng Chua

It does, however, have two limitations: 1) the two-view formulation does not fully exploit all the heterogeneous relations among users, bundles and items; and 2) the "early contrast and late fusion" framework is less effective in capturing user preference and difficult to generalize to multiple views.

Contrastive Learning Representation Learning

Paper
Code

Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching

no code implementations • 21 Nov 2023 • Meng Chu, Zhedong Zheng, Wei Ji, Tingyu Wang, Tat-Seng Chua

Navigating drones through natural language commands remains challenging due to the dearth of accessible multi-modal datasets and the stringent precision requirements for aligning visual and textual data.

Drone navigation Language Modelling +2

Paper
Add Code

Do LLMs Implicitly Exhibit User Discrimination in Recommendation? An Empirical Study

no code implementations • 13 Nov 2023 • Chen Xu, Wenjie Wang, Yuxin Li, Liang Pang, Jun Xu, Tat-Seng Chua

Recently, Large Language Models (LLMs) have enhanced user interaction, enabling seamless information retrieval and recommendations.

Information Retrieval Recommendation Systems +1

Paper
Add Code

NExT-Chat: An LMM for Chat, Detection and Segmentation

1 code implementation • 8 Nov 2023 • Ao Zhang, Yuan YAO, Wei Ji, Zhiyuan Liu, Tat-Seng Chua

The development of large language models (LLMs) has greatly advanced the field of multimodal understanding, leading to the emergence of large multimodal models (LMMs).

Referring Expression Referring Expression Segmentation +1

174

Paper
Code

Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents

1 code implementation • 1 Nov 2023 • Yang Deng, Wenxuan Zhang, Wai Lam, See-Kiong Ng, Tat-Seng Chua

Proactive dialogues serve as a practical yet challenging dialogue problem in the era of large language models (LLMs), where the dialogue policy planning is the key to improving the proactivity of LLMs.

Language Modelling Large Language Model

Paper
Code

Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction

1 code implementation • 28 Oct 2023 • Yunshan Ma, Xiaohao Liu, Yinwei Wei, Zhulin Tao, Xiang Wang, Tat-Seng Chua

Specifically, we use self-attention modules to combine the multimodal and multi-item features, and then leverage both item- and bundle-level contrastive learning to enhance the representation learning, thus to counter the modality missing, noise, and sparsity problems.

Contrastive Learning Representation Learning

Paper
Code

Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss

1 code implementation • NeurIPS 2023 • An Zhang, Leheng Sheng, Zhibo Cai, Xiang Wang, Tat-Seng Chua

To bridge the gap, we delve into the reasons underpinning the success of contrastive loss in CF, and propose a principled Adversarial InfoNCE loss (AdvInfoNCE), which is a variant of InfoNCE, specially tailored for CF methods.

Collaborative Filtering Contrastive Learning +3

Paper
Code

Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules

1 code implementation • NeurIPS 2023 • Zhiyuan Liu, Yaorui Shi, An Zhang, Enzhi Zhang, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua

Our results show that a subgraph-level tokenizer and a sufficiently expressive decoder with remask decoding have a large impact on the encoder's representation learning.

Decoder Representation Learning +1

Paper
Code

MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter

1 code implementation • 19 Oct 2023 • Zhiyuan Liu, Sihang Li, Yanchen Luo, Hao Fei, Yixin Cao, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua

MolCA enables an LM (e. g., Galactica) to understand both text- and graph-based molecular contents via the cross-modal projector.

Ranked #4 on Molecule Captioning on ChEBI-20

Contrastive Learning IUPAC Name Prediction +5

Paper
Code

A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction

1 code implementation • 18 Oct 2023 • Ruihao Shui, Yixin Cao, Xiang Wang, Tat-Seng Chua

Large language models (LLMs) have demonstrated great potential for domain-specific applications, such as the law domain.

Information Retrieval Legal Reasoning +1

Paper
Code

On Generative Agents in Recommendation

1 code implementation • 16 Oct 2023 • An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-Seng Chua

Recommender systems are the cornerstone of today's information dissemination, yet a disconnect between offline metrics and online performance greatly hinders their development.

Collaborative Filtering Movie Recommendation +1

200

Paper
Code

Robust Collaborative Filtering to Popularity Distribution Shift

1 code implementation • 16 Oct 2023 • An Zhang, Wenchang Ma, Jingnan Zheng, Xiang Wang, Tat-Seng Chua

The popularity shortcut tricks are good for in-distribution (ID) performance but poorly generalized to out-of-distribution (OOD) data, i. e., when popularity distribution of test data shifts w. r. t.

Collaborative Filtering

Paper
Code

Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators

1 code implementation • 11 Oct 2023 • Liang Chen, Yang Deng, Yatao Bian, Zeyu Qin, Bingzhe Wu, Tat-Seng Chua, Kam-Fai Wong

Large language models (LLMs) outperform information retrieval techniques for downstream knowledge-intensive tasks when being prompted to generate world knowledge.

Information Retrieval Informativeness +4

Paper
Code

A Multi-facet Paradigm to Bridge Large Language Model and Recommendation

no code implementations • 10 Oct 2023 • Xinyu Lin, Wenjie Wang, Yongqi Li, Fuli Feng, See-Kiong Ng, Tat-Seng Chua

To combat these issues, we propose a novel multi-facet paradigm, namely TransRec, to bridge the LLMs to recommendation.

Attribute Language Modelling +2

Paper
Add Code

Progressive Text-to-3D Generation for Automatic 3D Prototyping

1 code implementation • 26 Sep 2023 • Han Yi, Zhedong Zheng, Xiangyu Xu, Tat-Seng Chua

We aspire for our work to pave the way for automatic 3D prototyping via natural language descriptions.

3D Generation Text to 3D

Paper
Code

NExT-GPT: Any-to-Any Multimodal LLM

1 code implementation • 11 Sep 2023 • Shengqiong Wu, Hao Fei, Leigang Qu, Wei Ji, Tat-Seng Chua

While recently Multimodal Large Language Models (MM-LLMs) have made exciting strides, they mostly fall prey to the limitation of only input-side multimodal understanding, without the ability to produce content in multiple modalities.

2,945

Paper
Code

Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs

no code implementations • 26 Aug 2023 • Hao Fei, Shengqiong Wu, Wei Ji, Hanwang Zhang, Tat-Seng Chua

In this work, we investigate strengthening the awareness of video dynamics for DMs, for high-quality T2V generation.

In-Context Learning Video Generation

Paper
Add Code

I3: Intent-Introspective Retrieval Conditioned on Instructions

no code implementations • 19 Aug 2023 • Kaihang Pan, Juncheng Li, Wenjie Wang, Hao Fei, Hongye Song, Wei Ji, Jun Lin, Xiaozhong Liu, Tat-Seng Chua, Siliang Tang

Recent studies indicate that dense retrieval models struggle to perform well on a wide variety of retrieval tasks that lack dedicated training data, as different retrieval tasks often entail distinct search intents.

Retrieval Text-to-Image Generation

Paper
Add Code

Diffusion Variational Autoencoder for Tackling Stochasticity in Multi-Step Regression Stock Price Prediction

1 code implementation • 18 Aug 2023 • Kelvin J. L. Koa, Yunshan Ma, Ritchie Ng, Tat-Seng Chua

The hierarchical VAE allows us to learn the complex and low-level latent variables for stock prediction, while the diffusion probabilistic model trains the predictor to handle stock price stochasticity by progressively adding random noise to the stock data.

regression Stock Prediction +1

Paper
Code

Context-aware Event Forecasting via Graph Disentanglement

1 code implementation • 12 Aug 2023 • Yunshan Ma, Chenchen Ye, Zijian Wu, Xiang Wang, Yixin Cao, Tat-Seng Chua

The task of event forecasting aims to model the relational and temporal patterns based on historical events and makes forecasting to what will happen in the future.

Disentanglement Link Prediction

Paper
Code

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

no code implementations • 9 Aug 2023 • Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie, Tat-Seng Chua

Afterward, we propose a fine-grained object-interaction diffusion method to synthesize high-faithfulness images conditioned on the prompt and the automatically generated layout.

In-Context Learning Text-to-Image Generation

Paper
Add Code

Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling

no code implementations • 9 Aug 2023 • Yu Zhao, Hao Fei, Yixin Cao, Bobo Li, Meishan Zhang, Jianguo Wei, Min Zhang, Tat-Seng Chua

A scene-event mapping mechanism is first designed to bridge the gap between the underlying scene structure and the high-level event semantic structure, resulting in an overall hierarchical scene-event (termed ICE) graph structure.

Semantic Role Labeling

Paper
Add Code

Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition

no code implementations • 8 Aug 2023 • Bobo Li, Hao Fei, Lizi Liao, Yu Zhao, Chong Teng, Tat-Seng Chua, Donghong Ji, Fei Li

On the other hand, during the feature fusion stage, we propose a Contribution-aware Fusion Mechanism (CFM) and a Context Refusion Mechanism (CRM) for multimodal and context integration, respectively.

Ranked #5 on Emotion Recognition in Conversation on IEMOCAP

Contrastive Learning Disentanglement +2

Paper
Add Code

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

1 code implementation • 8 Aug 2023 • Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang, Hanwang Zhang, Yueting Zhuang

This shortcoming results in MLLMs' underperformance in comprehending demonstrative instructions consisting of multiple, interleaved, and multimodal instructions that demonstrate the required context to complete a task.

Caption Generation Image Captioning +1

326

Paper
Code

Redundancy-aware Transformer for Video Question Answering

no code implementations • 7 Aug 2023 • Yicong Li, Xun Yang, An Zhang, Chun Feng, Xiang Wang, Tat-Seng Chua

This paper identifies two kinds of redundancy in the current VideoQA paradigm.

Question Answering Video Question Answering

Paper
Add Code

SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning

2 code implementations • 3 Aug 2023 • Keyu Duan, Qian Liu, Tat-Seng Chua, Shuicheng Yan, Wei Tsang Ooi, Qizhe Xie, Junxian He

More recently, with the rapid development of language models (LMs), researchers have focused on leveraging LMs to facilitate the learning of TGs, either by jointly training them in a computationally intensive framework (merging the two stages), or designing complex self-supervised training tasks for feature extraction (enhancing the first stage).

Ranked #1 on Node Property Prediction on ogbn-arxiv

Feature Engineering Graph Learning +3

Paper
Code

XNLP: An Interactive Demonstration System for Universal Structured NLP

no code implementations • 3 Aug 2023 • Hao Fei, Meishan Zhang, Min Zhang, Tat-Seng Chua

Structured Natural Language Processing (XNLP) is an important subset of NLP that entails understanding the underlying semantic or syntactic structure of texts, which serves as a foundational component for many downstream applications.

Paper
Add Code

Discovering Spatio-Temporal Rationales for Video Question Answering

1 code implementation • ICCV 2023 • Yicong Li, Junbin Xiao, Chun Feng, Xiang Wang, Tat-Seng Chua

We then conduct extensive studies to verify the importance of STR as well as the proposed answer interaction mechanism.

Question Answering Video Question Answering

Paper
Code

LightGT: A Light Graph Transformer for Multimedia Recommendation

1 code implementation • SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval 2023 • Yinwei Wei, Wenqi Liu, Fan Liu, Xiang Wang, Liqiang Nie, Tat-Seng Chua

Considering its challenges in effectiveness and efficiency, we propose a novel Transformer-based recommendation model, termed as Light Graph Transformer model (LightGT).

Ranked #1 on Multi-Media Recommendation on Kwai (Recall@10 metric)

Collaborative Filtering Microvideo Recommendation +4

Paper
Code

Revisiting Conversation Discourse for Dialogue Disentanglement

no code implementations • 6 Jun 2023 • Bobo Li, Hao Fei, Fei Li, Shengqiong Wu, Lizi Liao, Yinwei Wei, Tat-Seng Chua, Donghong Ji

Conversation utterances are essentially organized and described by the underlying discourse, and thus dialogue disentanglement requires the full understanding and harnessing of the intrinsic discourse attribute.

Attribute Disentanglement

Paper
Add Code

Computational Technologies for Fashion Recommendation: A Survey

no code implementations • 6 Jun 2023 • Yujuan Ding, Zhihui Lai, P. Y. Mok, Tat-Seng Chua

Fashion recommendation is a key research field in computational fashion research and has attracted considerable interest in the computer vision, multimedia, and information retrieval communities in recent years.

Information Retrieval Product Recommendation

Paper
Add Code

Discovering Dynamic Causal Space for DAG Structure Learning

1 code implementation • 5 Jun 2023 • Fangfu Liu, Wenchang Ma, An Zhang, Xiang Wang, Yueqi Duan, Tat-Seng Chua

Discovering causal structure from purely observational data (i. e., causal discovery), aiming to identify causal relationships among variables, is a fundamental task in machine learning.

Causal Discovery Combinatorial Optimization

Paper
Code

LLMDet: A Third Party Large Language Models Generated Text Detection Tool

1 code implementation • 24 May 2023 • Kangxi Wu, Liang Pang, HuaWei Shen, Xueqi Cheng, Tat-Seng Chua

By jointly analyzing the proxy perplexities of LLMs, we can determine the source of the generated text.

Language Modelling Large Language Model +1

Paper
Code

Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration

1 code implementation • 23 May 2023 • Yang Deng, Lizi Liao, Liang Chen, Hongru Wang, Wenqiang Lei, Tat-Seng Chua

Conversational systems based on Large Language Models (LLMs), such as ChatGPT, show exceptional proficiency in context understanding and response generation.

Descriptive Response Generation

Paper
Code

Robust Prompt Optimization for Large Language Models Against Distribution Shifts

no code implementations • 23 May 2023 • Moxin Li, Wenjie Wang, Fuli Feng, Yixin Cao, Jizhi Zhang, Tat-Seng Chua

In this light, we propose a new problem of robust prompt optimization for LLMs against distribution shifts, which requires the prompt optimized over the labeled source group can simultaneously generalize to an unlabeled target group.

Language Modelling Large Language Model

Paper
Add Code

MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent Space

1 code implementation • 22 May 2023 • Hanxing Ding, Liang Pang, Zihao Wei, HuaWei Shen, Xueqi Cheng, Tat-Seng Chua

Multi-aspect controllable text generation aims to generate fluent sentences that possess multiple desired attributes simultaneously.

Attribute Decoder +1

Paper
Code

InstructVid2Vid: Controllable Video Editing with Natural Language Instructions

no code implementations • 21 May 2023 • Bosheng Qin, Juncheng Li, Siliang Tang, Tat-Seng Chua, Yueting Zhuang

We introduce InstructVid2Vid, an end-to-end diffusion-based methodology for video editing guided by human language instructions.

Attribute Image Generation +2

Paper
Add Code

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination

1 code implementation • 20 May 2023 • Hao Fei, Qian Liu, Meishan Zhang, Min Zhang, Tat-Seng Chua

In this work, we investigate a more realistic unsupervised multimodal machine translation (UMMT) setup, inference-time image-free UMMT, where the model is trained with source-text image pairs, and tested with only source-text inputs.

Hallucination Multimodal Machine Translation +1

Paper
Code

Constructing Code-mixed Universal Dependency Forest for Unbiased Cross-lingual Relation Extraction

no code implementations • 20 May 2023 • Hao Fei, Meishan Zhang, Min Zhang, Tat-Seng Chua

Latest efforts on cross-lingual relation extraction (XRE) aggressively leverage the language-consistent structural features from the universal dependency (UD) resource, while they may largely suffer from biased transfer (e. g., either target-biased or source-biased) due to the inevitable linguistic disparity between languages.

Relation Relation Extraction +1

Paper
Add Code

Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment

no code implementations • 20 May 2023 • Shengqiong Wu, Hao Fei, Wei Ji, Tat-Seng Chua

Unpaired cross-lingual image captioning has long suffered from irrelevancy and disfluency issues, due to the inconsistencies of the semantic scene and syntax attributes during transfer.

Image Captioning Translation

Paper
Add Code

Generating Visual Spatial Description via Holistic 3D Scene Understanding

1 code implementation • 19 May 2023 • Yu Zhao, Hao Fei, Wei Ji, Jianguo Wei, Meishan Zhang, Min Zhang, Tat-Seng Chua

With an external 3D scene extractor, we obtain the 3D objects and scene features for input images, based on which we construct a target object-centered 3D spatial scene graph (Go3D-S2G), such that we model the spatial semantics of target objects within the holistic 3D scenes.

Scene Understanding Text Generation

Paper
Code

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling

1 code implementation • 19 May 2023 • Shengqiong Wu, Hao Fei, Yixin Cao, Lidong Bing, Tat-Seng Chua

First, we represent the fine-grained semantic structures of the input image and text with the visual and textual scene graphs, which are further fused into a unified cross-modal graph (CMG).

Denoising Relation +1

Paper
Code

Reasoning Implicit Sentiment with Chain-of-Thought Prompting

2 code implementations • 18 May 2023 • Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-Seng Chua

While sentiment analysis systems try to determine the sentiment polarities of given targets based on the key opinion expressions in input texts, in implicit sentiment analysis (ISA) the opinion cues come in an implicit and obscure manner.

Common Sense Reasoning Sentiment Analysis

Paper
Code

Dual Semantic Knowledge Composed Multimodal Dialog Systems

no code implementations • 17 May 2023 • Xiaolin Chen, Xuemeng Song, Yinwei Wei, Liqiang Nie, Tat-Seng Chua

Thereafter, considering that the attribute knowledge and relation knowledge can benefit the responding to different levels of questions, we design a multi-level knowledge composition module in MDS-S2 to obtain the latent composed response representation.

Attribute Relation +1

Paper
Add Code

A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects

no code implementations • 4 May 2023 • Yang Deng, Wenqiang Lei, Wai Lam, Tat-Seng Chua

Proactive dialogue systems, related to a wide range of real-world conversational applications, equip the conversational agent with the capability of leading the conversation direction towards achieving pre-defined targets or fulfilling certain goals from the system side.

Paper
Add Code

Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs

1 code implementation • 3 May 2023 • Fengbin Zhu, Chao Wang, Fuli Feng, Zifeng Ren, Moxin Li, Tat-Seng Chua

Discrete reasoning over table-text documents (e. g., financial reports) gains increasing attention in recent two years.

Paper
Code

VPGTrans: Transfer Visual Prompt Generator across LLMs

1 code implementation • NeurIPS 2023 • Ao Zhang, Hao Fei, Yuan YAO, Wei Ji, Li Li, Zhiyuan Liu, Tat-Seng Chua

While developing a new multimodal LLM (MLLM) by pre-training on tremendous image-text pairs from scratch can be exceedingly resource-consuming, connecting an existing LLM with a comparatively lightweight visual prompt generator (VPG) becomes a feasible paradigm.

Transfer Learning

264

Paper
Code

Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks

1 code implementation • 28 Apr 2023 • Shicheng Xu, Liang Pang, HuaWei Shen, Xueqi Cheng, Tat-Seng Chua

This paper proposes a novel framework named \textbf{Search-in-the-Chain} (SearChain) for the interaction between LLM and IR to solve the challenges.

Fact Checking Information Retrieval +7

Paper
Code

Learnable Pillar-based Re-ranking for Image-Text Retrieval

1 code implementation • 25 Apr 2023 • Leigang Qu, Meng Liu, Wenjie Wang, Zhedong Zheng, Liqiang Nie, Tat-Seng Chua

Image-text retrieval aims to bridge the modality gap and retrieve cross-modal content based on semantic similarities.

Re-Ranking Retrieval +1

Paper
Code

On the Robustness of Aspect-based Sentiment Analysis: Rethinking Model, Data, and Training

no code implementations • 19 Apr 2023 • Hao Fei, Tat-Seng Chua, Chenliang Li, Donghong Ji, Meishan Zhang, Yafeng Ren

In this study, we propose to enhance the ABSA robustness by systematically rethinking the bottlenecks from all possible angles, including model, data, and training.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Add Code

LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model

1 code implementation • 13 Apr 2023 • Hao Fei, Shengqiong Wu, Jingye Li, Bobo Li, Fei Li, Libo Qin, Meishan Zhang, Min Zhang, Tat-Seng Chua

Universally modeling all typical information extraction tasks (UIE) with one generative language model (GLM) has revealed great potential by the latest study, where various IE predictions are unified into a linearized hierarchical expression under a GLM.

Language Modelling UIE

Paper
Code

Diffusion Recommender Model

1 code implementation • 11 Apr 2023 • Wenjie Wang, Yiyan Xu, Fuli Feng, Xinyu Lin, Xiangnan He, Tat-Seng Chua

In light of the impressive advantages of Diffusion Models (DMs) over traditional generative models in image synthesis, we propose a novel Diffusion Recommender Model (named DiffRec) to learn the generative process in a denoising manner.

Denoising Image Generation +1

155

Paper
Code

Generative Recommendation: Towards Next-generation Recommender Paradigm

1 code implementation • 7 Apr 2023 • Wenjie Wang, Xinyu Lin, Fuli Feng, Xiangnan He, Tat-Seng Chua

However, such a retrieval-based recommender paradigm faces two limitations: 1) the human-generated items in the corpus might fail to satisfy the users' diverse information needs, and 2) users usually adjust the recommendations via inefficient passive feedback, e. g., clicks.

Recommendation Systems Retrieval +1

Paper
Code

Causal Disentangled Recommendation Against User Preference Shifts

1 code implementation • 28 Mar 2023 • Wenjie Wang, Xinyu Lin, Liuhui Wang, Fuli Feng, Yunshan Ma, Tat-Seng Chua

Inspired by the causal graph, our key considerations to handle preference shifts lie in modeling the interaction generation procedure by: 1) capturing the preference shifts across environments for accurate preference prediction, and 2) disentangling the sparse influence from user preference to interactions for accurate effect estimation of preference.

Recommendation Systems

Paper
Code

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models

no code implementations • ICCV 2023 • Juncheng Li, Minghe Gao, Longhui Wei, Siliang Tang, Wenqiao Zhang, Mengze Li, Wei Ji, Qi Tian, Tat-Seng Chua, Yueting Zhuang

Prompt tuning, a recently emerging paradigm, enables the powerful vision-language pre-training models to adapt to downstream tasks in a parameter -- and data -- efficient way, by learning the ``soft prompts'' to condition frozen pre-training models.

Domain Generalization Few-Shot Learning +1

Paper
Add Code

Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting

1 code implementation • 6 Mar 2023 • An Zhang, Fangfu Liu, Wenchang Ma, Zhibo Cai, Xiang Wang, Tat-Seng Chua

Despite great success in low-dimensional linear systems, it has been observed that these approaches overly exploit easier-to-fit samples, thus inevitably learning spurious edges.

Bilevel Optimization Causal Discovery

Paper
Code

Contrastive Video Question Answering via Video Graph Transformer

1 code implementation • 27 Feb 2023 • Junbin Xiao, Pan Zhou, Angela Yao, Yicong Li, Richang Hong, Shuicheng Yan, Tat-Seng Chua

CoVGT's uniqueness and superiority are three-fold: 1) It proposes a dynamic graph transformer module which encodes video by explicitly capturing the visual objects, their relations and dynamics, for complex spatio-temporal reasoning.

Ranked #14 on Video Question Answering on NExT-QA (using extra training data)

Contrastive Learning Question Answering +1

Paper
Code

Invariant Collaborative Filtering to Popularity Distribution Shift

1 code implementation • 10 Feb 2023 • An Zhang, Jingnan Zheng, Xiang Wang, Yancheng Yuan, Tat-Seng Chua

Collaborative Filtering (CF) models, despite their great success, suffer from severe performance drops due to popularity distribution shifts, where these changes are ubiquitous and inevitable in real-world scenarios.

Collaborative Filtering Representation Learning

Paper
Code

Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding

no code implementations • 22 Jan 2023 • Juncheng Li, Siliang Tang, Linchao Zhu, Wenqiao Zhang, Yi Yang, Tat-Seng Chua, Fei Wu, Yueting Zhuang

To systematically benchmark the compositional generalizability of temporal grounding models, we introduce a new Compositional Temporal Grounding task and construct two new dataset splits, i. e., Charades-CG and ActivityNet-CG.

Semantic correspondence Sentence

Paper
Add Code

Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning

1 code implementation • CVPR 2023 • Wei Ji, Renjie Liang, Zhedong Zheng, Wenqiao Zhang, Shengyu Zhang, Juncheng Li, Mengze Li, Tat-Seng Chua

Moreover, we treat the uncertainty score of frames in a video as a whole, and estimate the difficulty of each video, which can further relieve the burden of video selection.

Active Learning Moment Retrieval +1

Paper
Code

MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding

no code implementations • 26 Dec 2022 • Wei Ji, Long Chen, Yinwei Wei, Yiming Wu, Tat-Seng Chua

In this work, we propose a novel multi-resolution temporal video sentence grounding network: MRTNet, which consists of a multi-modal feature encoder, a Multi-Resolution Temporal (MRT) module, and a predictor module.

Decoder Descriptive +1

Paper
Add Code

Causal Inference for Knowledge Graph based Recommendation

1 code implementation • 20 Dec 2022 • Yinwei Wei, Xiang Wang, Liqiang Nie, Shaoyu Li, Dingxian Wang, Tat-Seng Chua

Knowledge Graph (KG), as a side-information, tends to be utilized to supplement the collaborative filtering (CF) based recommendation model.

Collaborative Filtering counterfactual +1

Paper
Code

Cognitive Accident Prediction in Driving Scenes: A Multimodality Benchmark

1 code implementation • 19 Dec 2022 • Jianwu Fang, Lei-Lei Li, Kuan Yang, Zhedong Zheng, Jianru Xue, Tat-Seng Chua

In particular, the text description provides a dense semantic description guidance for the primary context of the traffic scene, while the driver attention provides a traction to focus on the critical region closely correlating with safe driving.

Decision Making

Paper
Code

Visually Grounded Commonsense Knowledge Acquisition

1 code implementation • 22 Nov 2022 • Yuan YAO, Tianyu Yu, Ao Zhang, Mengdi Li, Ruobing Xie, Cornelius Weber, Zhiyuan Liu, Hai-Tao Zheng, Stefan Wermter, Tat-Seng Chua, Maosong Sun

In this work, we present CLEVER, which formulates CKE as a distantly supervised multi-instance learning problem, where models learn to summarize commonsense relations from a bag of images about an entity pair without any human annotation on image instances.

Language Modelling

Paper
Code

PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation

1 code implementation • 14 Nov 2022 • Mu Chen, Zhedong Zheng, Yi Yang, Tat-Seng Chua

In an attempt to fill this gap, we propose a unified pixel- and patch-wise self-supervised learning framework, called PiPa, for domain adaptive semantic segmentation that facilitates intra-image pixel-wise correlations and patch-wise semantic consistency against different contexts.

Ranked #1 on Semantic Segmentation on SYNTHIA-to-Cityscapes

Self-Supervised Learning Semantic Segmentation +2

Paper
Code

Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization

1 code implementation • 14 Nov 2022 • Yiyang Chen, Zhedong Zheng, Wei Ji, Leigang Qu, Tat-Seng Chua

The key idea underpinning the proposed method is to integrate fine- and coarse-grained retrieval as matching data points with small and large fluctuations, respectively.

Ranked #3 on Image Retrieval with Multi-Modal Query on Fashion200k

Composed Image Retrieval (CoIR) Image Retrieval with Multi-Modal Query +1

Paper
Code

DiaASQ : A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis

1 code implementation • 10 Nov 2022 • Bobo Li, Hao Fei, Fei Li, Yuhan Wu, Jinsong Zhang, Shengqiong Wu, Jingye Li, Yijiang Liu, Lizi Liao, Tat-Seng Chua, Donghong Ji

The rapid development of aspect-based sentiment analysis (ABSA) within recent decades shows great potential for real-world society.

Ranked #1 on Conversational Sentiment Quadruple Extraction on DiaASQ (ZH)

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Code

Behavioral Intention Prediction in Driving Scenes: A Survey

no code implementations • 1 Nov 2022 • Jianwu Fang, Fan Wang, Jianru Xue, Tat-Seng Chua

Behavioral Intention Prediction (BIP) simulates such a human consideration process and fulfills the early prediction of specific behaviors.

Trajectory Prediction

Paper
Add Code

Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering

1 code implementation • 20 Oct 2022 • An Zhang, Wenchang Ma, Xiang Wang, Tat-Seng Chua

Collaborative filtering (CF) models easily suffer from popularity bias, which makes recommendation deviate from users' actual preferences.

Collaborative Filtering

Paper
Code

ConReader: Exploring Implicit Relations in Contracts for Contract Clause Extraction

1 code implementation • 17 Oct 2022 • Weiwen Xu, Yang Deng, Wenqiang Lei, Wenlong Zhao, Tat-Seng Chua, Wai Lam

We study automatic Contract Clause Extraction (CCE) by modeling implicit relations in legal contracts.

Implicit Relations Relation

Paper
Code

PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance

1 code implementation • 17 Oct 2022 • Yang Deng, Wenqiang Lei, Wenxuan Zhang, Wai Lam, Tat-Seng Chua

To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC.

Code Generation Conversational Question Answering +3

Paper
Code

Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation

1 code implementation • 17 Aug 2022 • Shengyu Zhang, Lingxiao Yang, Dong Yao, Yujie Lu, Fuli Feng, Zhou Zhao, Tat-Seng Chua, Fei Wu

Specifically, Re4 encapsulates three backward flows, i. e., 1) Re-contrast, which drives each interest embedding to be distinct from other interests using contrastive learning; 2) Re-attend, which ensures the interest-item correlation estimation in the forward flow to be consistent with the criterion used in final recommendation; and 3) Re-construct, which ensures that each interest embedding can semantically reflect the information of representative items that relate to the corresponding interest.

Contrastive Learning Recommendation Systems

Paper
Code

CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation

no code implementations • 17 Aug 2022 • Shengyu Zhang, Bofang Li, Dong Yao, Fuli Feng, Jieming Zhu, Wenyan Fan, Zhou Zhao, Xiaofei He, Tat-Seng Chua, Fei Wu

Micro-video recommender systems suffer from the ubiquitous noises in users' behaviors, which might render the learned user representation indiscriminating, and lead to trivial recommendations (e. g., popular items) or even weird ones that are far beyond users' interests.

Contrastive Learning Recommendation Systems

Paper
Add Code

Equivariant and Invariant Grounding for Video Question Answering

1 code implementation • 26 Jul 2022 • Yicong Li, Xiang Wang, Junbin Xiao, Tat-Seng Chua

Specifically, the equivariant grounding encourages the answering to be sensitive to the semantic changes in the causal scene and question; in contrast, the invariant grounding enforces the answering to be insensitive to the changes in the environment scene.

Question Answering Video Question Answering

Paper
Code

Towards Complex Document Understanding By Discrete Reasoning

no code implementations • 25 Jul 2022 • Fengbin Zhu, Wenqiang Lei, Fuli Feng, Chao Wang, Haozhou Zhang, Tat-Seng Chua

Document Visual Question Answering (VQA) aims to understand visually-rich documents to answer questions in natural language, which is an emerging research topic for both Natural Language Processing and Computer Vision.

document understanding Question Answering +1

Paper
Add Code

Video Graph Transformer for Video Question Answering

1 code implementation • 12 Jul 2022 • Junbin Xiao, Pan Zhou, Tat-Seng Chua, Shuicheng Yan

VGT's uniqueness are two-fold: 1) it designs a dynamic graph transformer module which encodes video by explicitly capturing the visual objects, their relations, and dynamics for complex spatio-temporal reasoning; and 2) it exploits disentangled video and text Transformers for relevance comparison between the video and text to perform QA, instead of entangled cross-modal Transformer for answer classification.

Ranked #4 on Video Question Answering on IntentQA

Question Answering Relation +2

Paper
Code

Structured and Natural Responses Co-generation for Conversational Search

1 code implementation • ACM SIGIR Conference on Research and Development in Information Retrieval 2022 • Chenchen Ye, Lizi Liao, Fuli Feng, Wei Ji, Tat-Seng Chua

Existing approaches either 1) predict structured dialog acts first and then generate natural response; or 2) map conversation context to natural responses directly in an end-to-end manner.

Conversational Search

Paper
Code

Intelligent Request Strategy Design in Recommender System

no code implementations • 23 Jun 2022 • Xufeng Qian, Yue Xu, Fuyu Lv, Shengyu Zhang, Ziwen Jiang, Qingwen Liu, Xiaoyi Zeng, Tat-Seng Chua, Fei Wu

RSs typically put a large number of items into one page to reduce excessive resource consumption from numerous paging requests, which, however, would diminish the RSs' ability to timely renew the recommendations according to users' real-time interest and lead to a poor user experience.

Causal Inference counterfactual +1

Paper
Add Code

Let Invariant Rationale Discovery Inspire Graph Contrastive Learning

1 code implementation • 16 Jun 2022 • Sihang Li, Xiang Wang, An Zhang, Yingxin Wu, Xiangnan He, Tat-Seng Chua

Specifically, without supervision signals, RGCL uses a rationale generator to reveal salient features about graph instance-discrimination as the rationale, and then creates rationale-aware views for contrastive learning.

Contrastive Learning

Paper
Code

Invariant Grounding for Video Question Answering

1 code implementation • CVPR 2022 • Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-Seng Chua

At its core is understanding the alignments between visual scenes in video and linguistic semantics in question to yield the answer.

Question Answering Video Question Answering

Paper
Code

CrossCBR: Cross-view Contrastive Learning for Bundle Recommendation

1 code implementation • 1 Jun 2022 • Yunshan Ma, Yingzhi He, An Zhang, Xiang Wang, Tat-Seng Chua

Recent methods usually take advantage of both user-bundle and user-item interactions information to obtain informative representations for users and bundles, corresponding to bundle view and item view, respectively.

Contrastive Learning Graph Learning

Paper
Code

Differentiable Invariant Causal Discovery

no code implementations • 31 May 2022 • Yu Wang, An Zhang, Xiang Wang, Yancheng Yuan, Xiangnan He, Tat-Seng Chua

This paper proposes Differentiable Invariant Causal Discovery (DICD), utilizing the multi-environment information based on a differentiable framework to avoid learning spurious edges and wrong causal directions.

Causal Discovery

Paper
Add Code

PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models

1 code implementation • 23 May 2022 • Yuan YAO, Qianyu Chen, Ao Zhang, Wei Ji, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun

We show that PEVL enables state-of-the-art performance of detector-free VLP models on position-sensitive tasks such as referring expression comprehension and phrase grounding, and also improves the performance on position-insensitive tasks with grounded inputs.

Ranked #1 on Visual Commonsense Reasoning on VCR (Q-AR) test

Language Modelling Object +7

Paper
Code

User-controllable Recommendation Against Filter Bubbles

1 code implementation • 29 Apr 2022 • Wenjie Wang, Fuli Feng, Liqiang Nie, Tat-Seng Chua

both accuracy and diversity.

Blocking counterfactual +3

Paper
Code

3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective

1 code implementation • 27 Apr 2022 • Zhedong Zheng, Jiayin Zhu, Wei Ji, Yi Yang, Tat-Seng Chua

This research aims to study a self-supervised 3D clothing reconstruction method, which recovers the geometry shape and texture of human clothing from a single image.

Ranked #1 on Single-View 3D Reconstruction on CUB-200-2011

3D Reconstruction Person Re-Identification +2

Paper
Code

Reinforced Causal Explainer for Graph Neural Networks

1 code implementation • 23 Apr 2022 • Xiang Wang, Yingxin Wu, An Zhang, Fuli Feng, Xiangnan He, Tat-Seng Chua

Such reward accounts for the dependency of the newly-added edge and the previously-added edges, thus reflecting whether they collaborate together and form a coalition to pursue better explanations.

Graph Classification

Paper
Code

Multiple-environment Self-adaptive Network for Aerial-view Geo-localization

2 code implementations • 18 Apr 2022 • Tingyu Wang, Zhedong Zheng, Yaoqi Sun, Chenggang Yan, Yi Yang, Tat-Seng Chua

This task is mostly regarded as an image retrieval problem.

Image Retrieval Retrieval

437

Paper
Code

A Unified Multi-task Learning Framework for Multi-goal Conversational Recommender Systems

1 code implementation • 14 Apr 2022 • Yang Deng, Wenxuan Zhang, Weiwen Xu, Wenqiang Lei, Tat-Seng Chua, Wai Lam

In this work, we propose a novel Unified MultI-goal conversational recommeNDer system, namely UniMIND.

Multi-Task Learning Recommendation Systems +1

Paper
Code

Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue Policy

no code implementations • 7 Apr 2022 • Wenqiang Lei, Yao Zhang, Feifan Song, Hongru Liang, Jiaxin Mao, Jiancheng Lv, Zhenglu Yang, Tat-Seng Chua

To this end, we contribute to advance the study of the proactive dialogue policy to a more natural and challenging setting, i. e., interacting dynamically with users.

Paper
Add Code

Fine-Grained Scene Graph Generation with Data Transfer

2 code implementations • 22 Mar 2022 • Ao Zhang, Yuan YAO, Qianyu Chen, Wei Ji, Zhiyuan Liu, Maosong Sun, Tat-Seng Chua

Scene graph generation (SGG) is designed to extract (subject, predicate, object) triplets in images.

Ranked #1 on Predicate Classification on Visual Genome

Graph Generation Predicate Classification +3

Paper
Code

Video Question Answering: Datasets, Algorithms and Challenges

1 code implementation • 2 Mar 2022 • Yaoyao Zhong, Junbin Xiao, Wei Ji, Yicong Li, Weihong Deng, Tat-Seng Chua

Video Question Answering (VideoQA) aims to answer natural language questions according to the given videos.

Question Answering Video Question Answering

Paper
Code

KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems

3 code implementations • 22 Feb 2022 • Chongming Gao, Shijun Li, Wenqiang Lei, Jiawei Chen, Biao Li, Peng Jiang, Xiangnan He, Jiaxin Mao, Tat-Seng Chua

The progress of recommender systems is hampered mainly by evaluation as it requires real-time interactions between humans and systems, which is too laborious and expensive.

Recommendation Systems User Simulation

119

Paper
Code

Discovering Invariant Rationales for Graph Neural Networks

1 code implementation • ICLR 2022 • Ying-Xin Wu, Xiang Wang, An Zhang, Xiangnan He, Tat-Seng Chua

Intrinsic interpretability of graph neural networks (GNNs) is to find a small subset of the input graph's features -- rationale -- which guides the model prediction.

Graph Classification

113

Paper
Code

Deconfounding to Explanation Evaluation in Graph Neural Networks

no code implementations • 21 Jan 2022 • Ying-Xin Wu, Xiang Wang, An Zhang, Xia Hu, Fuli Feng, Xiangnan He, Tat-Seng Chua

In this work, we propose Deconfounded Subgraph Evaluation (DSE) which assesses the causal effect of an explanatory subgraph on the model prediction.

Paper
Add Code

A Dual Prompt Learning Framework for Few-Shot Dialogue State Tracking

no code implementations • 15 Jan 2022 • Yuting Yang, Wenqiang Lei, Pei Huang, Juan Cao, Jintao Li, Tat-Seng Chua

In this paper, we focus on how to utilize the language understanding and generation ability of pre-trained language models for DST.

Dialogue State Tracking Language Modelling

Paper
Add Code

Training Free Graph Neural Networks for Graph Matching

1 code implementation • 14 Jan 2022 • Zhiyuan Liu, Yixin Cao, Fuli Feng, Xiang Wang, Jie Tang, Kenji Kawaguchi, Tat-Seng Chua

We present a framework of Training Free Graph Matching (TFGM) to boost the performance of Graph Neural Networks (GNNs) based graph matching, providing a fast promising solution without training (training-free).

Entity Alignment Graph Matching +1

Paper
Code

Causal Attention for Interpretable and Generalizable Graph Classification

1 code implementation • 30 Dec 2021 • Yongduo Sui, Xiang Wang, Jiancan Wu, Min Lin, Xiangnan He, Tat-Seng Chua

To endow the classifier with better interpretation and generalization, we propose the Causal Attention Learning (CAL) strategy, which discovers the causal patterns and mitigates the confounding effect of shortcuts.

Graph Attention Graph Classification

Paper
Code

Video as Conditional Graph Hierarchy for Multi-Granular Question Answering

1 code implementation • 12 Dec 2021 • Junbin Xiao, Angela Yao, Zhiyuan Liu, Yicong Li, Wei Ji, Tat-Seng Chua

To align with the multi-granular essence of linguistic concepts in language queries, we propose to model video as a conditional graph hierarchy which weaves together visual facts of different granularity in a level-wise manner, with the guidance of corresponding textual cues.

Ranked #5 on Video Question Answering on IntentQA

Question Answering Video Question Answering +1

Paper
Code

Rethinking the Two-Stage Framework for Grounded Situation Recognition

1 code implementation • 10 Dec 2021 • Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua

Since each verb is associated with a specific set of semantic roles, all existing GSR methods resort to a two-stage framework: predicting the verb in the first stage and detecting the semantic roles in the second stage.

Ranked #3 on Situation Recognition on imSitu

Grounded Situation Recognition Object Recognition +1

Paper
Code

Learning Robust Recommender from Noisy Implicit Feedback

1 code implementation • 2 Dec 2021 • Wenjie Wang, Fuli Feng, Xiangnan He, Liqiang Nie, Tat-Seng Chua

Inspired by this observation, we propose a new training strategy named Adaptive Denoising Training (ADT), which adaptively prunes the noisy interactions by two paradigms (i. e., Truncated Loss and Reweighted Loss).

Denoising Recommendation Systems

Paper
Code

Towards Multi-Grained Explainability for Graph Neural Networks

1 code implementation • NeurIPS 2021 • Xiang Wang, Yingxin Wu, An Zhang, Xiangnan He, Tat-Seng Chua

A performant paradigm towards multi-grained explainability is until-now lacking and thus a focus of our work.

Paper
Code

Inductive Lottery Ticket Learning for Graph Neural Networks

no code implementations • 29 Sep 2021 • Yongduo Sui, Xiang Wang, Tianlong Chen, Xiangnan He, Tat-Seng Chua

In this work, we propose a simple and effective learning paradigm, Inductive Co-Pruning of GNNs (ICPG), to endow graph lottery tickets with inductive pruning capacity.

Graph Classification Node Classification +1

Paper
Add Code

Decoupling Strategy and Surface Realization for Task-oriented Dialogues

no code implementations • 29 Sep 2021 • Chenchen Ye, Lizi Liao, Fuli Feng, Wei Ji, Tat-Seng Chua

The core is to construct a latent content space for strategy optimization and disentangle the surface style from it.

Reinforcement Learning (RL) Style Transfer +1

Paper
Add Code

Why Do We Click: Visual Impression-aware News Recommendation

1 code implementation • 26 Sep 2021 • Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Qi Zhang, Jingjie Li, Xiuqiang He, Xiaofei He, Tat-Seng Chua, Fei Wu

In this work, inspired by the fact that users make their click decisions mostly based on the visual impression they perceive when browsing news, we propose to capture such visual impression information with visual-semantic modeling for news recommendation.

Decision Making News Recommendation

Paper
Code

CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models

1 code implementation • 24 Sep 2021 • Yuan YAO, Ao Zhang, Zhengyan Zhang, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun

Pre-Trained Vision-Language Models (VL-PTMs) have shown promising capabilities in grounding natural language in image data, facilitating a broad variety of cross-modal tasks.

Visual Grounding

Paper
Code

CauseRec: Counterfactual User Sequence Synthesis for Sequential Recommendation

no code implementations • 11 Sep 2021 • Shengyu Zhang, Dong Yao, Zhou Zhao, Tat-Seng Chua, Fei Wu

In this paper, we propose to learn accurate and robust user representations, which are required to be less sensitive to (attack on) noisy behaviors and trust more on the indispensable ones, by modeling counterfactual data distribution.

counterfactual Representation Learning +1

Paper
Add Code

How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction

1 code implementation • ACL 2021 • Zikun Hu, Yixin Cao, Lifu Huang, Tat-Seng Chua

In this paper, we contribute a dataset and propose a paradigm to quantitatively evaluate the effect of attention and KG on bag-level relation extraction (RE).

Relation Relation Extraction

Paper
Code

How Knowledge Graph and Attention Help? A Quantitative Analysis into Bag-level Relation Extraction

1 code implementation • 26 Jul 2021 • Zikun Hu, Yixin Cao, Lifu Huang, Tat-Seng Chua

In this paper, we contribute a dataset and propose a paradigm to quantitatively evaluate the effect of attention and KG on bag-level relation extraction (RE).

Relation Relation Extraction

Paper
Code

Contrastive Learning for Cold-Start Recommendation

1 code implementation • 12 Jul 2021 • Yinwei Wei, Xiang Wang, Qi Li, Liqiang Nie, Yan Li, Xuanping Li, Tat-Seng Chua

It aims to maximize the mutual dependencies between item content and collaborative signals.

Contrastive Learning Recommendation Systems +1

Paper
Code

MConv: An Environment for Multimodal Conversational Search across Multiple Domains

1 code implementation • SIGIR 2021 • Lizi Liao, Le Hong Long, Zheng Zhang, Minlie Huang, Tat-Seng Chua

Second, a set of benchmark results for dialogue state tracking, conversational recommendation, response generation as well as a unified model for multiple tasks are reported.

Ranked #2 on Dialogue State Tracking on MMConv

Conversational Search Dialogue State Tracking +1

Paper
Code

Affective Image Content Analysis: Two Decades Review and New Perspectives

no code implementations • 30 Jun 2021 • Sicheng Zhao, Xingxu Yao, Jufeng Yang, Guoli Jia, Guiguang Ding, Tat-Seng Chua, Björn W. Schuller, Kurt Keutzer

Images can convey rich semantics and induce various emotions in viewers.

Emotional Intelligence Emotion Recognition +1

Paper
Add Code

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions

1 code implementation • CVPR 2021 • Junbin Xiao, Xindi Shang, Angela Yao, Tat-Seng Chua

We introduce NExT-QA, a rigorously designed video question answering (VideoQA) benchmark to advance video understanding from describing to explaining the temporal actions.

Question Answering Video Question Answering +2

100

Paper
Code

Empowering Language Understanding with Counterfactual Reasoning

1 code implementation • Findings (ACL) 2021 • Fuli Feng, Jizhi Zhang, Xiangnan He, Hanwang Zhang, Tat-Seng Chua

Present language understanding methods have demonstrated extraordinary ability of recognizing patterns in texts via machine learning.

counterfactual Counterfactual Reasoning +2

Paper
Code

Deconfounded Video Moment Retrieval with Causal Intervention

1 code implementation • 3 Jun 2021 • Xun Yang, Fuli Feng, Wei Ji, Meng Wang, Tat-Seng Chua

To fill the research gap, we propose a causality-inspired VMR framework that builds structural causal model to capture the true effect of query and video content on the prediction.

Moment Retrieval Retrieval

Paper
Code

Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting

1 code implementation • 25 May 2021 • Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua, Jinyoung Moon, Hong-Han Shuai

This companion paper supports the replication of the fashion trend forecasting experiments with the KERN (Knowledge Enhanced Recurrent Network) method that we presented in the ICMR 2020.

Paper
Code

Deconfounded Recommendation for Alleviating Bias Amplification

1 code implementation • 22 May 2021 • Wenjie Wang, Fuli Feng, Xiangnan He, Xiang Wang, Tat-Seng Chua

In this work, we scrutinize the cause-effect factors for bias amplification, identifying the main reason lies in the confounder effect of imbalanced item distribution on user representation and prediction score.

Fairness Recommendation Systems

Paper
Code

NExT-QA:Next Phase of Question-Answering to Explaining Temporal Actions

2 code implementations • 18 May 2021 • Junbin Xiao, Xindi Shang, Angela Yao, Tat-Seng Chua

We introduce NExT-QA, a rigorously designed video question answering (VideoQA) benchmark to advance video understanding from describing to explaining the temporal actions.

Question Answering Video Question Answering +2

100

Paper
Code

TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance

1 code implementation • ACL 2021 • Fengbin Zhu, Wenqiang Lei, Youcheng Huang, Chao Wang, Shuo Zhang, Jiancheng Lv, Fuli Feng, Tat-Seng Chua

In this work, we extract samples from real financial reports to build a new large-scale QA dataset containing both Tabular And Textual data, named TAT-QA, where numerical reasoning is usually required to infer the answer, such as addition, subtraction, multiplication, division, counting, comparison/sorting, and the compositions.

Ranked #1 on Question Answering on TAT-QA

Question Answering

Paper
Code

Leveraging Two Types of Global Graph for Sequential Fashion Recommendation

no code implementations • 17 May 2021 • Yujuan Ding, Yunshan Ma, Wai Keung Wong, Tat-Seng Chua

Sequential fashion recommendation is of great significance in online fashion shopping, which accounts for an increasing portion of either fashion retailing or online e-commerce.

Vocal Bursts Valence Prediction

Paper
Add Code

Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction

1 code implementation • NAACL 2021 • Zhenghao Liu, Xiaoyuan Yi, Maosong Sun, Liner Yang, Tat-Seng Chua

Grammatical Error Correction (GEC) aims to correct writing errors and help language learners improve their writing skills.

Ranked #1 on Grammatical Error Detection on FCE

Grammatical Error Detection

Paper
Code

Leveraging Multiple Relations for Fashion Trend Forecasting Based on Social Media

no code implementations • 7 May 2021 • Yujuan Ding, Yunshan Ma, Lizi Liao, Wai Keung Wong, Tat-Seng Chua

Towards insightful fashion trend forecasting, previous work [1] proposed to analyze more fine-grained fashion elements which can informatively reveal fashion trends.

Time Series Analysis

Paper
Add Code

A-FMI: Learning Attributions from Deep Networks via Feature Map Importance

no code implementations • 12 Apr 2021 • An Zhang, Xiang Wang, Chengfang Fang, Jie Shi, Tat-Seng Chua, Zehua Chen

Gradient-based attribution methods can aid in the understanding of convolutional neural networks (CNNs).

Paper
Add Code

Conditional Hyper-Network for Blind Super-Resolution with Multiple Degradations

1 code implementation • 8 Apr 2021 • Guanghao Yin, Wei Wang, Zehuan Yuan, Wei Ji, Dongdong Yu, Shouqian Sun, Tat-Seng Chua, Changhu Wang

We extract degradation prior at task-level with the proposed ConditionNet, which will be used to adapt the parameters of the basic SR network (BaseNet).

Blind Super-Resolution Image Super-Resolution

Paper
Code

Learning Intents behind Interactions with Knowledge Graph for Recommendation

2 code implementations • 14 Feb 2021 • Xiang Wang, Tinglin Huang, Dingxian Wang, Yancheng Yuan, Zhenguang Liu, Xiangnan He, Tat-Seng Chua

In this study, we explore intents behind a user-item interaction by using auxiliary item knowledge, and propose a new model, Knowledge Graph-based Intent Network (KGIN).

Recommendation Systems Relation

206

Paper
Code

Advances and Challenges in Conversational Recommender Systems: A Survey

no code implementations • 23 Jan 2021 • Chongming Gao, Wenqiang Lei, Xiangnan He, Maarten de Rijke, Tat-Seng Chua

In this paper, we provide a systematic review of the techniques used in current CRSs.

Dialogue Understanding Information Retrieval +3

Paper
Add Code

Retrieving and Reading: A Comprehensive Survey on Open-domain Question Answering

no code implementations • 4 Jan 2021 • Fengbin Zhu, Wenqiang Lei, Chao Wang, Jianming Zheng, Soujanya Poria, Tat-Seng Chua

Open-domain Question Answering (OpenQA) is an important task in Natural Language Processing (NLP), which aims to answer a question in the form of natural language based on large-scale unstructured documents.

Machine Reading Comprehension Open-Domain Question Answering

Paper
Add Code

Causal Screening to Interpret Graph Neural Networks

no code implementations • 1 Jan 2021 • Xiang Wang, Yingxin Wu, An Zhang, Xiangnan He, Tat-Seng Chua

In this work, we focus on the causal interpretability in GNNs and propose a method, Causal Screening, from the perspective of cause-effect.

Explanation Generation

Paper
Add Code

Learning Relation Prototype from Unlabeled Texts for Long-tail Relation Extraction

1 code implementation • 27 Nov 2020 • Yixin Cao, Jun Kuang, Ming Gao, Aoying Zhou, Yonggang Wen, Tat-Seng Chua

In this paper, we propose a general approach to learn relation prototypesfrom unlabeled texts, to facilitate the long-tail relation extraction by transferring knowledge from the relation types with sufficient trainingdata.

Relation Relation Extraction +1

Paper
Code

Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method

1 code implementation • 22 Oct 2020 • Fuli Feng, Weiran Huang, Xiangnan He, Xin Xin, Qifan Wang, Tat-Seng Chua

To this end, we analyze the working mechanism of GCN with causal graph, estimating the causal effect of a node's local structure for the prediction.

Blocking Causal Inference +4

Paper
Code

Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment

1 code implementation • EMNLP 2020 • Yixin Cao, Liangming Pan, Juanzi Li, Zhiyuan Liu, Tat-Seng Chua

GNN-based EA methods present promising performances by modeling the KG structure defined by relation triples.

Attribute Entity Alignment

Paper
Code

Clicks can be Cheating: Counterfactual Recommendation for Mitigating Clickbait Issue

1 code implementation • 21 Sep 2020 • Wenjie Wang, Fuli Feng, Xiangnan He, Hanwang Zhang, Tat-Seng Chua

However, we argue that there is a significant gap between clicks and user satisfaction -- it is common that a user is "cheated" to click an item by the attractive title/cover of the item.

Click-Through Rate Prediction counterfactual +1

Paper
Code

Multi-modal Cooking Workflow Construction for Food Recipes

no code implementations • 20 Aug 2020 • Liangming Pan, Jingjing Chen, Jianlong Wu, Shaoteng Liu, Chong-Wah Ngo, Min-Yen Kan, Yu-Gang Jiang, Tat-Seng Chua

Understanding food recipe requires anticipating the implicit causal effects of cooking actions, such that the recipe can be converted into a graph describing the temporal workflow of the recipe.

Common Sense Reasoning Decoder

Paper
Add Code

Neural Sparse Voxel Fields

1 code implementation • NeurIPS 2020 • Lingjie Liu, Jiatao Gu, Kyaw Zaw Lin, Tat-Seng Chua, Christian Theobalt

We also demonstrate several challenging tasks, including multi-scene learning, free-viewpoint rendering of a moving human, and large-scale scene rendering.

793

Paper
Code

Visual Relation Grounding in Videos

1 code implementation • ECCV 2020 • Junbin Xiao, Xindi Shang, Xun Yang, Sheng Tang, Tat-Seng Chua

In this paper, we explore a novel task named visual Relation Grounding in Videos (vRGV).

Question Answering Relation +2

Paper
Code

Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval

no code implementations • 6 Jul 2020 • Xun Yang, Jianfeng Dong, Yixin Cao, Xun Wang, Meng Wang, Tat-Seng Chua

To facilitate video retrieval with complex queries, we propose a Tree-augmented Cross-modal Encoding method by jointly learning the linguistic structure of queries and the temporal representation of videos.

Retrieval Video Retrieval

Paper
Add Code

Disentangled Graph Collaborative Filtering

2 code implementations • 3 Jul 2020 • Xiang Wang, Hongye Jin, An Zhang, Xiangnan He, Tong Xu, Tat-Seng Chua

Such uniform approach to model user interests easily results in suboptimal representations, failing to model diverse relationships and disentangle user intents in representations.

Collaborative Filtering Disentanglement

129

Paper
Code

Interactive Path Reasoning on Graph for Conversational Recommendation

no code implementations • 1 Jul 2020 • Wenqiang Lei, Gangyi Zhang, Xiangnan He, Yisong Miao, Xiang Wang, Liang Chen, Tat-Seng Chua

Traditional recommendation systems estimate user preference on items from past interaction history, thus suffering from the limitations of obtaining fine-grained and dynamic user preference.

Attribute Recommendation Systems

Paper
Add Code

Few-shot 3D Point Cloud Semantic Segmentation

1 code implementation • CVPR 2021 • Na Zhao, Tat-Seng Chua, Gim Hee Lee

These fully supervised approaches heavily rely on large amounts of labeled training data that are difficult to obtain and cannot segment new classes after training.

Ranked #1 on Few-shot 3D Point Cloud Semantic Segmentation on S3DIS

Few-shot 3D Point Cloud Semantic Segmentation Segmentation +1

153

Paper
Code

Denoising Implicit Feedback for Recommendation

2 code implementations • 7 Jun 2020 • Wenjie Wang, Fuli Feng, Xiangnan He, Liqiang Nie, Tat-Seng Chua

In this work, we explore the central theme of denoising implicit feedback for recommender training.

Denoising Recommendation Systems

Paper
Code

Rethinking Dialogue State Tracking with Reasoning

no code implementations • 27 May 2020 • Lizi Liao, Yunshan Ma, Wenqiang Lei, Tat-Seng Chua

Tracking dialogue states to better interpret user goals and feed downstream policy learning is a bottleneck in dialogue management.

Dialogue Management Dialogue State Tracking +1

Paper
Add Code

Hierarchical Fashion Graph Network for Personalized Outfit Recommendation

1 code implementation • 26 May 2020 • Xingchen Li, Xiang Wang, Xiangnan He, Long Chen, Jun Xiao, Tat-Seng Chua

Fashion outfit recommendation has attracted increasing attentions from online shopping services and fashion communities. Distinct from other scenarios (e. g., social networking or content sharing) which recommend a single item (e. g., a friend or picture) to a user, outfit recommendation predicts user preference on a set of well-matched fashion items. Hence, performing high-quality personalized outfit recommendation should satisfy two requirements -- 1) the nice compatibility of fashion items and 2) the consistence with user preference.

Paper
Code

Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users

1 code implementation • 23 May 2020 • Shijun Li, Wenqiang Lei, Qingyun Wu, Xiangnan He, Peng Jiang, Tat-Seng Chua

In this work, we consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively.

Collaborative Filtering Thompson Sampling

Paper
Code

Knowledge Enhanced Neural Fashion Trend Forecasting

1 code implementation • 7 May 2020 • Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua

Further-more, to effectively model the time series data of fashion elements with rather complex patterns, we propose a Knowledge EnhancedRecurrent Network model (KERN) which takes advantage of the capability of deep recurrent neural networks in modeling time-series data.

Time Series Time Series Analysis

Paper
Code

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen

no code implementations • ACL 2020 • Yixin Cao, Ruihao Shui, Liangming Pan, Min-Yen Kan, Zhiyuan Liu, Tat-Seng Chua

The curse of knowledge can impede communication between experts and laymen.

Style Transfer Text Simplification

Paper
Add Code

Semantic Graphs for Generating Deep Questions

1 code implementation • ACL 2020 • Liangming Pan, Yuxi Xie, Yansong Feng, Tat-Seng Chua, Min-Yen Kan

This paper proposes the problem of Deep Question Generation (DQG), which aims to generate complex questions that require reasoning over multiple pieces of information of the input passage.

Question Generation Question-Generation

Paper
Code

Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Zheng Zhang, Lizi Liao, Xiaoyan Zhu, Tat-Seng Chua, Zitao Liu, Yan Huang, Minlie Huang

Most existing approaches for goal-oriented dialogue policy learning used reinforcement learning, which focuses on the target agent policy and simply treat the opposite agent policy as part of the environment.

Decision Making

Paper
Add Code

Reinforced Negative Sampling over Knowledge Graph for Recommendation

1 code implementation • 12 Mar 2020 • Xiang Wang, Yaokun Xu, Xiangnan He, Yixin Cao, Meng Wang, Tat-Seng Chua

Properly handling missing data is a fundamental challenge in recommendation.

131

Paper
Code

Cross-GCN: Enhancing Graph Convolutional Network with $k$-Order Feature Interactions

no code implementations • 5 Mar 2020 • Fuli Feng, Xiangnan He, Hanwang Zhang, Tat-Seng Chua

Graph Convolutional Network (GCN) is an emerging technique that performs learning and reasoning on graph data.

Document Classification

Paper
Add Code

Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems

no code implementations • 21 Feb 2020 • Wenqiang Lei, Xiangnan He, Yisong Miao, Qingyun Wu, Richang Hong, Min-Yen Kan, Tat-Seng Chua

Recommender systems are embracing conversational technologies to obtain user preferences dynamically, and to overcome inherent limitations of their static models.

Recommendation Systems

Paper
Add Code

Multi-source Domain Adaptation for Visual Sentiment Classification

no code implementations • 12 Jan 2020 • Chuang Lin, Sicheng Zhao, Lei Meng, Tat-Seng Chua

Existing domain adaptation methods on visual sentiment classification typically are investigated under the single-source scenario, where the knowledge learned from a source domain of sufficient labeled data is transferred to the target domain of loosely labeled or unlabeled data.

Classification Domain Adaptation +4

Paper
Add Code

SESS: Self-Ensembling Semi-Supervised 3D Object Detection

1 code implementation • CVPR 2020 • Na Zhao, Tat-Seng Chua, Gim Hee Lee

The performance of existing point cloud-based 3D object detection methods heavily relies on large-scale high-quality 3D annotations.

3D Object Detection Object +2

134

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.