Recent Articles

Open access

ISSN: 2666-6510

Joint span and token framework for few-shot named entity recognition

Few-shot Named Entity Recognition (NER) is a challenging task that involves identifying new entity types using a limited number of labeled instances for training. Currently, the majority of Few-shot...

Share article

MOTT: A new model for multi-object tracking based on green learning paradigm

Multi-object tracking (MOT) is one of the most essential and challenging tasks in computer vision (CV). Unlike object detectors, MOT systems nowadays are more complicated and consist of several neural...

Share article

Multi-grained hypergraph interest modeling for conversational recommendation

Conversational recommender system (CRS) interacts with users through multi-turn dialogues in natural language, which aims to provide high-quality recommendations for user’s instant information need....

Share article

A unified network embedding algorithm for multi-type similarity measures

Traditional network embedding aims to learn representations by capturing a predefined vertex-to-vertex similarity measure. However, in practice, there are different types of similarity measures (e.g.,...

Share article

Batch virtual adversarial training for graph convolutional networks

We present batch virtual adversarial training (BVAT), a novel regularization method for graph convolutional networks (GCNs). BVAT addresses the issue that GCNs do not ensure the smoothness of the model’s...

Share article

AdaDS: Adaptive data selection for accelerating pre-trained language model knowledge distillation

Knowledge distillation (KD) is a widely used method for transferring knowledge from large teacher models to computationally efficient student models. Unfortunately, the computational cost of KD becomes...

Share article

A survey on complex factual question answering

Answering complex factual questions has drawn a lot of attention. Researchers leverage various data sources to support complex QA, such as unstructured texts, structured knowledge graphs and relational...

Share article

Language as a latent sequence: Deep latent variable models for semi-supervised paraphrase generation

This paper explores deep latent variable models for semi-supervised paraphrase generation, where the missing target pair for unlabelled data is modelled as a latent paraphrase sequence. We present a...

Share article

Learning fair representations via an adversarial framework

Fairness has become a central issue for our research community as classification algorithms are adopted in societally critical domains such as recidivism prediction and loan approval. In this work,...

Share article

Is Chinese Spelling Check ready? Understanding the correction behavior in real-world scenarios

The task of Chinese Spelling Check (CSC) is crucial for identifying and rectifying spelling errors in Chinese texts. While prior work in this domain has predominantly relied on benchmarks such as SIGHAN...

Share article

Semantic graph based topic modelling framework for multilingual fake news detection

Fake news detection is one of the most alluring problems that has grabbed the interest of Machine Learning (ML) and Natural Language Processing (NLP) experts in recent years. The majority of existing...

Share article

Word sense induction with agglomerative clustering and mutual information maximization

Word sense induction (WSI) is a challenging problem in natural language processing that involves the unsupervised automatic detection of a word’s senses (i.e., meanings). Recent work achieves significant...

Share article

Sarcasm detection using news headlines dataset

Sarcasm has been an elusive concept for humans. Due to interesting linguistic properties, sarcasm detection has gained traction of the Natural Language Processing (NLP) research community in the past...

Share article

Graph-based methods for cervical cancer segmentation: Advancements, limitations, and future directions

Cervical cancer remains a significant health concern worldwide, where precise segmentation of cervical lesions is integral for effective diagnosis and treatment planning. This systematic review critically...

Share article

MONEY: Ensemble learning for stock price movement prediction via a convolutional network with adversarial hypergraph model

Stock price prediction is challenging in financial investment, with the AI boom leading to increased interest from researchers. Despite these recent advances, many studies are limited to capturing the...

Share article

Interactive active learning for fairness with partial group label

The rapid development of AI technologies has found numerous applications across various domains in human society. Ensuring fairness and preventing discrimination are critical considerations in the development...

Share article

Restricted orthogonal gradient projection for continual learning

Continual learning aims to avoid catastrophic forgetting and effectively leverage learned experiences to master new knowledge. Existing gradient projection approaches impose hard constraints on the...

Share article

UPRec: User-aware Pre-training for sequential Recommendation

Recent years witness the success of pre-trained models to alleviate the data sparsity problem in recommender systems. However, existing pre-trained models for recommendation mainly focus on leveraging...

Share article

CAILIE 1.0: A dataset for Challenge of AI in Law - Information Extraction V1.0

Legal information extraction requires identifying and classifying legal elements from specific legal documents. Considering that information extraction is mainly regarded as the first step in natural...

Share article

HSSDA: Hierarchical relation aided Semi-Supervised Domain Adaptation

The mainstream domain adaptation (DA) methods transfer the supervised source domain knowledge to the unsupervised or semi-supervised target domain, so as to assist the classification task in the target...

Share article

Optimized separable convolution: Yet another efficient convolution operator

The convolution operation is the most critical component in recent surge of deep learning research. Conventional 2D convolution needs O(C2K2) parameters to represent, where C is the channel size and...

Share article

Stay Informed

Register your interest and receive email alerts tailored to your needs. Sign up below.