Skip to main content

Showing 1–50 of 65 results for author: Nie, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04963  [pdf, other

    cs.LG cs.AI

    Learning Divergence Fields for Shift-Robust Graph Representations

    Authors: Qitian Wu, Fan Nie, Chenxiao Yang, Junchi Yan

    Abstract: Real-world data generation often involves certain geometries (e.g., graphs) that induce instance-level interdependence. This characteristic makes the generalization of learning models more difficult due to the intricate interdependent patterns that impact data-generative distributions and can vary from training to testing. In this work, we propose a geometric diffusion model with learnable diverge… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024. Source codes at https://github.com/fannie1208/GLIND

  2. arXiv:2405.13427  [pdf, ps, other

    cs.LG

    Adaptive Fuzzy C-Means with Graph Embedding

    Authors: Qiang Chen, Weizhong Yu, Feiping Nie, Xuelong Li

    Abstract: Fuzzy clustering algorithms can be roughly categorized into two main groups: Fuzzy C-Means (FCM) based methods and mixture model based methods. However, for almost all existing FCM based methods, how to automatically selecting proper membership degree hyper-parameter values remains a challenging and unsolved problem. Mixture model based methods, while circumventing the difficulty of manually adjus… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2404.16616  [pdf, other

    cs.LG

    Robust Capped lp-Norm Support Vector Ordinal Regression

    Authors: Haorui Xiang, Zhichang Wu, Guoxu Li, Rong Wang, Feiping Nie, Xuelong Li

    Abstract: Ordinal regression is a specialized supervised problem where the labels show an inherent order. The order distinguishes it from normal multi-class problem. Support Vector Ordinal Regression, as an outstanding ordinal regression model, is widely used in many ordinal regression tasks. However, like most supervised learning algorithms, the design of SVOR is based on the assumption that the training d… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  4. arXiv:2404.00044  [pdf, other

    physics.chem-ph cs.AI cs.LG q-bio.QM

    UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment

    Authors: Kaipeng Zeng, Bo yang, Xin Zhao, Yu Zhang, Fan Nie, Xiaokang Yang, Yaohui Jin, Yanyan Xu

    Abstract: Motivation: Retrosynthesis planning poses a formidable challenge in the organic chemical industry. Single-step retrosynthesis prediction, a crucial step in the planning process, has witnessed a surge in interest in recent years due to advancements in AI for science. Various deep learning-based methods have been proposed for this task in recent years, incorporating diverse levels of additional chem… ▽ More

    Submitted 19 April, 2024; v1 submitted 24 March, 2024; originally announced April 2024.

  5. arXiv:2403.05014  [pdf, other

    cs.LG cs.AI

    Simple Multigraph Convolution Networks

    Authors: Danyang Wu, Xinjie Shen, Jitao Lu, Jin Xu, Feiping Nie

    Abstract: Existing multigraph convolution methods either ignore the cross-view interaction among multiple graphs, or induce extremely high computational cost due to standard cross-view polynomial operators. To alleviate this problem, this paper proposes a Simple MultiGraph Convolution Networks (SMGCN) which first extracts consistent cross-view topology from multigraphs including edge-level and subgraph-leve… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024 Short

  6. arXiv:2403.00307  [pdf, other

    cs.CV cs.AI

    Embedded Multi-label Feature Selection via Orthogonal Regression

    Authors: Xueyuan Xu, Fulin Wei, Tianyuan Jia, Li Zhuo, Feiping Nie, Xia Wu

    Abstract: In the last decade, embedded multi-label feature selection methods, incorporating the search for feature subsets into model optimization, have attracted considerable attention in accurately evaluating the importance of features in multi-label classification tasks. Nevertheless, the state-of-the-art embedded multi-label feature selection algorithms based on least square regression usually cannot pr… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  7. arXiv:2402.11494  [pdf, other

    cs.LG cs.SI

    Graph Out-of-Distribution Generalization via Causal Intervention

    Authors: Qitian Wu, Fan Nie, Chenxiao Yang, Tianyi Bao, Junchi Yan

    Abstract: Out-of-distribution (OOD) generalization has gained increasing attentions for learning on graphs, as graph neural networks (GNNs) often exhibit performance degradation with distribution shifts. The challenge is that distribution shifts on graphs involve intricate interconnections between nodes, and the environment labels are often absent in data. In this paper, we adopt a bottom-up data-generative… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted by the research paper track of The Web Conference (WWW) 2024. The codes are available at https://github.com/fannie1208/CaNet

  8. arXiv:2312.06578  [pdf, other

    cs.LG

    Multi-class Support Vector Machine with Maximizing Minimum Margin

    Authors: Feiping Nie, Zhezheng Hao, Rong Wang

    Abstract: Support Vector Machine (SVM) stands out as a prominent machine learning technique widely applied in practical pattern recognition tasks. It achieves binary classification by maximizing the "margin", which represents the minimum distance between instances and the decision boundary. Although many efforts have been dedicated to expanding SVM for multi-class case through strategies such as one versus… ▽ More

    Submitted 14 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  9. A Novel Normalized-Cut Solver with Nearest Neighbor Hierarchical Initialization

    Authors: Feiping Nie, Jitao Lu, Danyang Wu, Rong Wang, Xuelong Li

    Abstract: Normalized-Cut (N-Cut) is a famous model of spectral clustering. The traditional N-Cut solvers are two-stage: 1) calculating the continuous spectral embedding of normalized Laplacian matrix; 2) discretization via $K$-means or spectral rotation. However, this paradigm brings two vital problems: 1) two-stage methods solve a relaxed version of the original problem, so they cannot obtain good solution… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  10. arXiv:2310.06417  [pdf, other

    cs.LG cs.AI

    Advective Diffusion Transformers for Topological Generalization in Graph Learning

    Authors: Qitian Wu, Chenxiao Yang, Kaipeng Zeng, Fan Nie, Michael Bronstein, Junchi Yan

    Abstract: Graph diffusion equations are intimately related to graph neural networks (GNNs) and have recently attracted attention as a principled framework for analyzing GNN dynamics, formalizing their expressive power, and justifying architectural choices. One key open questions in graph learning is the generalization capabilities of GNNs. A major limitation of current approaches hinges on the assumption th… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 39 pages

  11. arXiv:2310.01098  [pdf, other

    cs.LG cs.AI cs.SI

    NP$^2$L: Negative Pseudo Partial Labels Extraction for Graph Neural Networks

    Authors: Xinjie Shen, Danyang Wu, Jitao Lu, Junjie Liang, Jin Xu, Feiping Nie

    Abstract: How to utilize the pseudo labels has always been a research hotspot in machine learning. However, most methods use pseudo labels as supervised training, and lack of valid assessing for their accuracy. Moreover, applications of pseudo labels in graph neural networks (GNNs) oversee the difference between graph learning and other machine learning tasks such as message passing mechanism. Aiming to add… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  12. arXiv:2309.16397  [pdf, other

    cs.LG cs.AI

    Uncertainty-Aware Decision Transformer for Stochastic Driving Environments

    Authors: Zenan Li, Fan Nie, Qiao Sun, Fang Da, Hang Zhao

    Abstract: Offline Reinforcement Learning (RL) has emerged as a promising framework for learning policies without active interactions, making it especially appealing for autonomous driving tasks. Recent successes of Transformers inspire casting offline RL as sequence modeling, which performs well in long-horizon tasks. However, they are overly optimistic in stochastic environments with incorrect assumptions… ▽ More

    Submitted 17 November, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

  13. arXiv:2309.13614  [pdf, other

    cs.RO cs.AI

    Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills

    Authors: Zenan Li, Fan Nie, Qiao Sun, Fang Da, Hang Zhao

    Abstract: Learning-based vehicle planning is receiving increasing attention with the emergence of diverse driving simulators and large-scale driving datasets. While offline reinforcement learning (RL) is well suited for these safety-critical tasks, it still struggles to plan over extended periods. In this work, we present a skill-based framework that enhances offline RL to overcome the long-horizon vehicle… ▽ More

    Submitted 17 November, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

  14. arXiv:2306.10759  [pdf, other

    cs.LG cs.AI cs.SI

    SGFormer: Simplifying and Empowering Transformers for Large-Graph Representations

    Authors: Qitian Wu, Wentao Zhao, Chenxiao Yang, Hengrui Zhang, Fan Nie, Haitian Jiang, Yatao Bian, Junchi Yan

    Abstract: Learning representations on large-sized graphs is a long-standing challenge due to the inter-dependence nature involved in massive data points. Transformers, as an emerging class of foundation encoders for graph-structured data, have shown promising performance on small graphs due to its global attention capable of capturing all-pair influence beyond neighboring nodes. Even so, existing approaches… ▽ More

    Submitted 4 January, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023, the codes are available at https://github.com/qitianwu/SGFormer

  15. arXiv:2305.12396  [pdf, other

    cs.LG

    Joint Feature and Differentiable $ k $-NN Graph Learning using Dirichlet Energy

    Authors: Lei Xu, Lei Chen, Rong Wang, Feiping Nie, Xuelong Li

    Abstract: Feature selection (FS) plays an important role in machine learning, which extracts important features and accelerates the learning process. In this paper, we propose a deep FS method that simultaneously conducts feature selection and differentiable $ k $-NN graph learning based on the Dirichlet Energy. The Dirichlet Energy identifies important features by measuring their smoothness on the graph st… ▽ More

    Submitted 9 November, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted by NeurIPS 2023

  16. arXiv:2305.07521  [pdf, other

    cs.LG

    AGFormer: Efficient Graph Representation with Anchor-Graph Transformer

    Authors: Bo Jiang, Fei Xu, Ziyan Zhang, Jin Tang, Feiping Nie

    Abstract: To alleviate the local receptive issue of GCN, Transformers have been exploited to capture the long range dependences of nodes for graph data representation and learning. However, existing graph Transformers generally employ regular self-attention module for all node-to-node message passing which needs to learn the affinities/relationships between all node's pairs, leading to high computational co… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  17. arXiv:2212.03589  [pdf, other

    cs.LG

    On the Global Solution of Soft k-Means

    Authors: Feiping Nie, Hong Chen, Rong Wang, Xuelong Li

    Abstract: This paper presents an algorithm to solve the Soft k-Means problem globally. Unlike Fuzzy c-Means, Soft k-Means (SkM) has a matrix factorization-type objective and has been shown to have a close relation with the popular probability decomposition-type clustering methods, e.g., Left Stochastic Clustering (LSC). Though some work has been done for solving the Soft k-Means problem, they usually use an… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  18. arXiv:2212.02216  [pdf, other

    cs.CL

    Improving Few-Shot Performance of Language Models via Nearest Neighbor Calibration

    Authors: Feng Nie, Meixi Chen, Zhirui Zhang, Xu Cheng

    Abstract: Pre-trained language models (PLMs) have exhibited remarkable few-shot learning capabilities when provided a few examples in a natural language prompt as demonstrations of test instances, i.e., in-context learning. However, the performance of in-context learning is susceptible to the choice of prompt format, training examples and the ordering of the training examples. In this paper, we propose a no… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Work in progress

  19. arXiv:2211.02883  [pdf, other

    cs.CV

    Unified Multi-View Orthonormal Non-Negative Graph Based Clustering Framework

    Authors: Liangchen Liu, Qiuhong Ke, Chaojie Li, Feiping Nie, Yingying Zhu

    Abstract: Spectral clustering is an effective methodology for unsupervised learning. Most traditional spectral clustering algorithms involve a separate two-step procedure and apply the transformed new representations for the final clustering results. Recently, much progress has been made to utilize the non-negative feature property in real-world data and to jointly learn the representation and clustering re… ▽ More

    Submitted 1 December, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

  20. arXiv:2204.07741  [pdf, other

    cs.HC cs.CL cs.LG

    Persua: A Visual Interactive System to Enhance the Persuasiveness of Arguments in Online Discussion

    Authors: Meng Xia, Qian Zhu, Xingbo Wang, Fei Nie, Huamin Qu, Xiaojuan Ma

    Abstract: Persuading people to change their opinions is a common practice in online discussion forums on topics ranging from political campaigns to relationship consultation. Enhancing people's ability to write persuasive arguments could not only practice their critical thinking and reasoning but also contribute to the effectiveness and civility in online communication. It is, however, not an easy task in o… ▽ More

    Submitted 21 April, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: This paper will appear in CSCW 2022

    Journal ref: Proc. ACM Hum.-Comput. Interact. 6, CSCW2, Article 319 (November 2022)

  21. arXiv:2204.04859  [pdf, other

    cs.CL cs.LG

    A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges

    Authors: Junyun Cui, Xiaoyu Shen, Feiping Nie, Zheng Wang, Jinglong Wang, Yulong Chen

    Abstract: Legal judgment prediction (LJP) applies Natural Language Processing (NLP) techniques to predict judgment results based on fact descriptions automatically. Recently, large-scale public datasets and advances in NLP research have led to increasing interest in LJP. Despite a clear gap between machine and human performance, impressive results have been achieved in various benchmark datasets. In this pa… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 25 pages, 6 figures and 12 tables

  22. arXiv:2201.13194   

    cs.LG

    Compactness Score: A Fast Filter Method for Unsupervised Feature Selection

    Authors: Peican Zhu, Xin Hou, Keke Tang, Zhen Wang, Feiping Nie

    Abstract: Along with the flourish of the information age, massive amounts of data are generated day by day. Due to the large-scale and high-dimensional characteristics of these data, it is often difficult to achieve better decision-making in practical applications. Therefore, an efficient big data analytics method is urgently needed. For feature engineering, feature selection seems to be an important resear… ▽ More

    Submitted 3 April, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Since the experimental section in the current version is not complete, it needs to be withdrawn and resubmitted

  23. Adaptive neighborhood Metric learning

    Authors: Kun Song, Junwei Han, Gong Cheng, Jiwen Lu, Feiping Nie

    Abstract: In this paper, we reveal that metric learning would suffer from serious inseparable problem if without informative sample mining. Since the inseparable samples are often mixed with hard samples, current informative sample mining strategies used to deal with inseparable problem may bring up some side-effects, such as instability of objective function, etc. To alleviate this problem, we propose a no… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 14 pages, 6 figures

  24. arXiv:2112.04734  [pdf, other

    cs.LG

    New Tight Relaxations of Rank Minimization for Multi-Task Learning

    Authors: Wei Chang, Feiping Nie, Rong Wang, Xuelong Li

    Abstract: Multi-task learning has been observed by many researchers, which supposes that different tasks can share a low-rank common yet latent subspace. It means learning multiple tasks jointly is better than learning them independently. In this paper, we propose two novel multi-task learning formulations based on two regularization terms, which can learn the optimal shared latent subspace by minimizing th… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  25. arXiv:2101.01976  [pdf, other

    eess.IV cs.CV

    Ensemble and Random Collaborative Representation-Based Anomaly Detector for Hyperspectral Imagery

    Authors: Rong Wang, Yihang Lu, Qianrong Zhang, Feiping Nie, Zhen Wang, Xuelong Li

    Abstract: In recent years, hyperspectral anomaly detection (HAD) has become an active topic and plays a significant role in military and civilian fields. As a classic HAD method, the collaboration representation-based detector (CRD) has attracted extensive attention and in-depth research. Despite the good performance of the CRD method, its computational cost mainly arising from the sliding dual window strat… ▽ More

    Submitted 22 December, 2021; v1 submitted 6 January, 2021; originally announced January 2021.

    Comments: 11 pages, 8 figures

  26. arXiv:2012.14595  [pdf, other

    cs.LG stat.ML

    Sparse PCA via $l_{2,p}$-Norm Regularization for Unsupervised Feature Selection

    Authors: Zhengxin Li, Feiping Nie, Jintang Bian, Xuelong Li

    Abstract: In the field of data mining, how to deal with high-dimensional data is an inevitable problem. Unsupervised feature selection has attracted more and more attention because it does not rely on labels. The performance of spectral-based unsupervised methods depends on the quality of constructed similarity matrix, which is used to depict the intrinsic structure of data. However, real-world data contain… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

  27. arXiv:2010.03084  [pdf, other

    cs.AI cs.CL cs.LG

    Program Enhanced Fact Verification with Verbalization and Graph Attention Network

    Authors: Xiaoyu Yang, Feng Nie, Yufei Feng, Quan Liu, Zhigang Chen, Xiaodan Zhu

    Abstract: Performing fact verification based on structured data is important for many real-life applications and is a challenging research problem, particularly when it involves both symbolic operations and informal inference based on language understanding. In this paper, we present a Program-enhanced Verbalization and Graph Attention Network (ProgVGAT) to integrate programs and execution into textual infe… ▽ More

    Submitted 11 September, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 16 pages (Accepted by EMNLP 2020 as a long paper)

  28. arXiv:2009.12362  [pdf, other

    cs.LG stat.ML

    Self-Weighted Robust LDA for Multiclass Classification with Edge Classes

    Authors: Caixia Yan, Xiaojun Chang, Minnan Luo, Qinghua Zheng, Xiaoqin Zhang, Zhihui Li, Feiping Nie

    Abstract: Linear discriminant analysis (LDA) is a popular technique to learn the most discriminative features for multi-class classification. A vast majority of existing LDA algorithms are prone to be dominated by the class with very large deviation from the others, i.e., edge class, which occurs frequently in multi-class classification. First, the existence of edge classes often makes the total mean biased… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 17 pages, has been accepted by ACM TIST

  29. arXiv:2007.00767  [pdf, other

    cs.LG cs.CV stat.ML

    NP-PROV: Neural Processes with Position-Relevant-Only Variances

    Authors: Xuesong Wang, Lina Yao, Xianzhi Wang, Feiping Nie

    Abstract: Neural Processes (NPs) families encode distributions over functions to a latent representation, given context data, and decode posterior mean and variance at unknown locations. Since mean and variance are derived from the same latent space, they may fail on out-of-domain tasks where fluctuations in function values amplify the model uncertainty. We present a new member named Neural Processes with P… ▽ More

    Submitted 15 June, 2020; originally announced July 2020.

    Comments: 10 pages, 5 figures

  30. arXiv:2005.05556  [pdf, other

    cs.LG stat.ML

    Agglomerative Neural Networks for Multi-view Clustering

    Authors: Zhe Liu, Yun Li, Lina Yao, Xianzhi Wang, Feiping Nie

    Abstract: Conventional multi-view clustering methods seek for a view consensus through minimizing the pairwise discrepancy between the consensus and subviews. However, the pairwise comparison cannot portray the inter-view relationship precisely if some of the subviews can be further agglomerated. To address the above challenge, we propose the agglomerative analysis to approximate the optimal consensus view,… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

  31. arXiv:2001.09414  [pdf, other

    cs.CV

    Curriculum Audiovisual Learning

    Authors: Di Hu, Zheng Wang, Haoyi Xiong, Dong Wang, Feiping Nie, Dejing Dou

    Abstract: Associating sound and its producer in complex audiovisual scene is a challenging task, especially when we are lack of annotated training data. In this paper, we present a flexible audiovisual model that introduces a soft-clustering module as the audio and visual content detector, and regards the pervasive property of audiovisual concurrency as the latent supervision for inferring the correlation a… ▽ More

    Submitted 26 January, 2020; originally announced January 2020.

  32. arXiv:1910.03787  [pdf

    cs.LG stat.ML

    Supervised feature selection with orthogonal regression and feature weighting

    Authors: Xia Wu, Xueyuan Xu, Jianhong Liu, Hailing Wang, Bin Hu, Feiping Nie

    Abstract: Effective features can improve the performance of a model, which can thus help us understand the characteristics and underlying structure of complex data. Previous feature selection methods usually cannot keep more local structure information. To address the defects previously mentioned, we propose a novel supervised orthogonal least square regression model with feature weighting for feature selec… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

  33. arXiv:1908.06699  [pdf, other

    cs.LG stat.ML

    Robust and Efficient Fuzzy C-Means Clustering Constrained on Flexible Sparsity

    Authors: Jinglin Xu, Junwei Han, Mingliang Xu, Feiping Nie, Xuelong Li

    Abstract: Clustering is an effective technique in data mining to group a set of objects in terms of some attributes. Among various clustering approaches, the family of K-Means algorithms gains popularity due to simplicity and efficiency. However, most of existing K-Means based clustering algorithms cannot deal with outliers well and are difficult to efficiently solve the problem embedded the $L_0$-norm cons… ▽ More

    Submitted 4 September, 2019; v1 submitted 19 August, 2019; originally announced August 2019.

  34. arXiv:1907.01121  [pdf, ps, other

    cs.LG stat.ML

    An Iteratively Re-weighted Method for Problems with Sparsity-Inducing Norms

    Authors: Feiping Nie, Zhanxuan Hu, Xiaoqian Wang, Rong Wang, Xuelong Li, Heng Huang

    Abstract: This work aims at solving the problems with intractable sparsity-inducing norms that are often encountered in various machine learning tasks, such as multi-task learning, subspace clustering, feature selection, robust principal component analysis, and so on. Specifically, an Iteratively Re-Weighted method (IRW) with solid convergence guarantee is provided. We investigate its convergence speed via… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 11 pages, 3 figures

  35. arXiv:1907.00211  [pdf, other

    cs.LG cs.CV stat.ML

    Robust Linear Discriminant Analysis Using Ratio Minimization of L1,2-Norms

    Authors: Feiping Nie, Hua Wang, Zheng Wang, Heng Huang

    Abstract: As one of the most popular linear subspace learning methods, the Linear Discriminant Analysis (LDA) method has been widely studied in machine learning community and applied to many scientific applications. Traditional LDA minimizes the ratio of squared L2-norms, which is sensitive to outliers. In recent research, many L1-norm based robust Principle Component Analysis methods were proposed to impro… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.

  36. arXiv:1906.08905  [pdf, other

    cs.LG stat.ML

    Intrinsic Weight Learning Approach for Multi-view Clustering

    Authors: Feiping Nie, Jing Li, Xuelong Li

    Abstract: Exploiting different representations, or views, of the same object for better clustering has become very popular these days, which is conventionally called multi-view clustering. Generally, it is essential to measure the importance of each individual view, due to some noises, or inherent capacities in description. Many previous works model the view importance as weight, which is simple but effecti… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

  37. arXiv:1904.10155  [pdf, other

    cs.LG stat.ML

    Learning Feature Sparse Principal Components

    Authors: Lai Tian, Feiping Nie, Xuelong Li

    Abstract: This paper presents new algorithms to solve the feature-sparsity constrained PCA problem (FSPCA), which performs feature selection and PCA simultaneously. Existing optimization methods for FSPCA require data distribution assumptions and are lack of global convergence guarantee. Though the general FSPCA problem is NP-hard, we show that, for a low-rank covariance, FSPCA can be solved globally (Algor… ▽ More

    Submitted 25 May, 2019; v1 submitted 23 April, 2019; originally announced April 2019.

  38. arXiv:1904.09115  [pdf, other

    cs.CV cs.HC cs.MM cs.SD eess.AS

    Listen to the Image

    Authors: Di Hu, Dong Wang, Xuelong Li, Feiping Nie, Qi Wang

    Abstract: Visual-to-auditory sensory substitution devices can assist the blind in sensing the visual environment by translating the visual information into a sound pattern. To improve the translation quality, the task performances of the blind are usually employed to evaluate different encoding schemes. In contrast to the toilsome human-based assessment, we argue that machine model can be also developed for… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

    Comments: Accepted by CVPR2019

  39. Feature Learning Viewpoint of AdaBoost and a New Algorithm

    Authors: Fei Wang, Zhongheng Li, Fang He, Rong Wang, Weizhong Yu, Feiping Nie

    Abstract: The AdaBoost algorithm has the superiority of resisting overfitting. Understanding the mysteries of this phenomena is a very fascinating fundamental theoretical problem. Many studies are devoted to explaining it from statistical view and margin theory. In this paper, we illustrate it from feature learning viewpoint, and propose the AdaBoost+SVM algorithm, which can explain the resistant to overfit… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Journal ref: IEEE Access, vol. 7, pp. 149890-149899, 2019

  40. arXiv:1810.03414  [pdf, other

    cs.CV cs.MM

    Dense Multimodal Fusion for Hierarchically Joint Representation

    Authors: Di Hu, Feiping Nie, Xuelong Li

    Abstract: Multiple modalities can provide more valuable information than single one by describing the same contents in various ways. Hence, it is highly expected to learn effective joint representation by fusing the features of different modalities. However, previous methods mainly focus on fusing the shallow features or high-level representations generated by unimodal deep networks, which only capture part… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 10 pages, 4 figures

  41. arXiv:1810.03402  [pdf, other

    cs.CV cs.MM

    Deep LDA Hashing

    Authors: Di Hu, Feiping Nie, Xuelong Li

    Abstract: The conventional supervised hashing methods based on classification do not entirely meet the requirements of hashing technique, but Linear Discriminant Analysis (LDA) does. In this paper, we propose to perform a revised LDA objective over deep networks to learn efficient hashing codes in a truly end-to-end fashion. However, the complicated eigenvalue decomposition within each mini-batch in every e… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 10 pages, 3 figures

  42. arXiv:1809.02735  [pdf, other

    cs.CL cs.AI

    Operations Guided Neural Networks for High Fidelity Data-To-Text Generation

    Authors: Feng Nie, Jinpeng Wang, Jin-Ge Yao, Rong Pan, Chin-Yew Lin

    Abstract: Recent neural models for data-to-text generation are mostly based on data-driven end-to-end training over encoder-decoder networks. Even though the generated texts are mostly fluent and informative, they often generate descriptions that are not consistent with the input structured data. This is a critical issue especially in domains that require inference or calculations over raw data. In this pap… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  43. arXiv:1808.05306   

    cs.CL

    Incorporating Consistency Verification into Neural Data-to-Document Generation

    Authors: Feng Nie, Hailin Chen, Jinpeng Wang, Jin-Ge Yao, Chin-Yew Lin, Rong Pan

    Abstract: Recent neural models for data-to-document generation have achieved remarkable progress in producing fluent and informative texts. However, large proportions of generated texts do not actually conform to the input data. To address this issue, we propose a new training framework which attempts to verify the consistency between the generated texts and the input data to guide the training process. To… ▽ More

    Submitted 17 August, 2018; v1 submitted 15 August, 2018; originally announced August 2018.

    Comments: Withdraw due to unqualified content and opinions of other authors; this work is not yet qualified for a conference submission

  44. arXiv:1808.04521  [pdf, other

    cs.CV cs.LG

    Low Rank Regularization: A Review

    Authors: Zhanxuan Hu, Feiping Nie, Rong Wang, Xuelong Li

    Abstract: Low rank regularization, in essence, involves introducing a low rank or approximately low rank assumption for matrix we aim to learn, which has achieved great success in many fields including machine learning, data mining and computer version. Over the last decade, much progress has been made in theories and practical applications. Nevertheless, the intersection between them is very slight. In ord… ▽ More

    Submitted 9 December, 2020; v1 submitted 14 August, 2018; originally announced August 2018.

    Comments: 16 pages,4 figures,4 tables

  45. arXiv:1807.03094  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Deep Multimodal Clustering for Unsupervised Audiovisual Learning

    Authors: Di Hu, Feiping Nie, Xuelong Li

    Abstract: The seen birds twitter, the running cars accompany with noise, etc. These naturally audiovisual correspondences provide the possibilities to explore and understand the outside world. However, the mixed multiple objects and sounds make it intractable to perform efficient matching in the unconstrained environment. To settle this problem, we propose to adequately excavate audio and visual components… ▽ More

    Submitted 19 April, 2019; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: Accepted by CVPR2019

  46. arXiv:1804.05090  [pdf, ps, other

    cs.LG cs.IR stat.ML

    Regularized Singular Value Decomposition and Application to Recommender System

    Authors: Shuai Zheng, Chris Ding, Feiping Nie

    Abstract: Singular value decomposition (SVD) is the mathematical basis of principal component analysis (PCA). Together, SVD and PCA are one of the most widely used mathematical formalism/decomposition in machine learning, data mining, pattern recognition, artificial intelligence, computer vision, signal processing, etc. In recent applications, regularization becomes an increasing trend. In this paper, we pr… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

  47. Ranking with Adaptive Neighbors

    Authors: Muge Li, Liangyue Li, Feiping Nie

    Abstract: Retrieving the most similar objects in a large-scale database for a given query is a fundamental building block in many application domains, ranging from web searches, visual, cross media, and document retrievals. State-of-the-art approaches have mainly focused on capturing the underlying geometry of the data manifolds. Graph-based approaches, in particular, define various diffusion processes on w… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.

    Comments: published at Tsinghua Science and Technology 22(6), 2017

  48. arXiv:1709.02896  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Simultaneously Learning Neighborship and Projection Matrix for Supervised Dimensionality Reduction

    Authors: Yanwei Pang, Bo Zhou, Feiping Nie

    Abstract: Explicitly or implicitly, most of dimensionality reduction methods need to determine which samples are neighbors and the similarity between the neighbors in the original highdimensional space. The projection matrix is then learned on the assumption that the neighborhood information (e.g., the similarity) is known and fixed prior to learning. However, it is difficult to precisely measure the intrin… ▽ More

    Submitted 8 September, 2017; originally announced September 2017.

  49. Deep Binary Reconstruction for Cross-modal Hashing

    Authors: Xuelong Li, Di Hu, Feiping Nie

    Abstract: With the increasing demand of massive multimodal data storage and organization, cross-modal retrieval based on hashing technique has drawn much attention nowadays. It takes the binary codes of one modality as the query to retrieve the relevant hashing codes of another modality. However, the existing binary constraint makes it difficult to find the optimal cross-modal hashing function. Most approac… ▽ More

    Submitted 23 August, 2017; v1 submitted 17 August, 2017; originally announced August 2017.

    Comments: 8 pages, 5 figures, accepted by ACM Multimedia 2017

  50. arXiv:1705.01206  [pdf, other

    cs.LG

    Local Shrunk Discriminant Analysis (LSDA)

    Authors: Zan Gao, Guotai Zhang, Feiping Nie, Hua Zhang

    Abstract: Dimensionality reduction is a crucial step for pattern recognition and data mining tasks to overcome the curse of dimensionality. Principal component analysis (PCA) is a traditional technique for unsupervised dimensionality reduction, which is often employed to seek a projection to best represent the data in a least-squares sense, but if the original data is nonlinear structure, the performance of… ▽ More

    Submitted 2 May, 2017; originally announced May 2017.