Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

Shuyu Chang 0000-0003-0771-2556 School of Computer ScienceNanjing University of Posts and Telecommunications [email protected] Rui Wang 0000-0001-9350-3667 School of Computer ScienceNanjing University of Posts and Telecommunications [email protected] Peng Ren School of Computer ScienceNanjing University of Posts and Telecommunications [email protected]  and  Haiping Huang 0000-0002-4392-3599 School of Computer ScienceNanjing University of Posts and Telecommunications [email protected]
(2024)
Abstract.

Crafting effective topic models for brief texts, like tweets and news headlines, is essential for capturing the swift shifts in social dynamics. Traditional topic models, however, often fall short in accurately representing the semantic intricacies of short texts due to their brevity and lack of contextual data. In our study, we harness the advanced capabilities of Large Language Models (LLMs) to introduce a novel approach termed ”Topic Refinement”. This approach does not directly involve itself in the initial modeling of topics but focuses on improving topics after they have been mined. By employing prompt engineering, we direct LLMs to eliminate off-topic words within a given topic, ensuring that only contextually relevant words are preserved or substituted with ones that fit better semantically. This method emulates human-like scrutiny and improvement of topics, thereby elevating the semantic quality of the topics generated by various models. Our comprehensive evaluation across three unique datasets has shown that our topic refinement approach significantly enhances the semantic coherence of topics.

Topic modeling, Topic refinement, Large language models, Short texts, Prompt engineering
copyright: acmlicensedjournalyear: 2024doi: XXXXXXX.XXXXXXXconference: XX; 2024; XXisbn: 978-1-4503-XXXX-X/18/06submissionid: 9831ccs: Computing methodologies Information extractionccs: Computing methodologies Natural language generation

1. Introduction

Refer to caption
Figure 1. Overview of topic refinement (N=10𝑁10N=10italic_N = 10): ①Prompt Construction; ②LLM Request and Response; ③Iterative Refinement.

Short texts, such as online comments and news headlines, are influential in shaping public opinion and mirroring social trends (Pang et al., 2021; Zhang et al., 2022, 2023). Analyzing these texts can uncover underlying patterns and topics, providing insights into social discourse. Topic modeling emerges as a valuable tool to extract topics from vast data (Lin et al., 2020; Wang et al., 2023). Nevertheless, the sparsity of context in short texts poses substantial challenges for conventional topic models (Blei et al., 2003; Yang et al., 2021). This brevity leads to low information density (Qiang et al., 2022), making it difficult to accurately capture and represent the semantics of discussed topics.

Some advancements in existing methods have been made to alleviate data sparsity. Early studies (Cheng et al., 2014; Li et al., 2016) address this challenge by enriching word co-occurrence within the corpus, thus discovering broader linguistic patterns. Later, researchers begin to incorporate external knowledge sources (Grootendorst, 2022; Meng et al., 2022; Zhang and Lauw, 2022; Churchill et al., 2022), like word embeddings (Wang et al., 2020; Wu et al., 2023; Zuo et al., 2023), into topic modeling to sufficiently capture semantic relationships. Another line of work focuses on generating quantified topic distributions tailored for short texts (Wu et al., 2020, 2022). These methods effectively aggregate short texts with similar topics into the same topic distribution. Recently, contrastive learning has been explored to further mitigate the effects of data sparsity (Nguyen and Luu, 2021; Han et al., 2023; Zhou et al., 2023). These models rely on data augmentation to learn from both similar and dissimilar text pairs, thereby enhancing the learning signals from short texts. Despite these advancements, existing methods frequently yield suboptimal semantic coherence of topics and impair their interpretability in practical applications.

To narrow these gaps, we turn to the booming and widely concerned Large Language Models (LLMs) such as PaLM (Chowdhery et al., 2023), GPT (Brown et al., 2020; Ouyang et al., 2022), and others. With their unprecedented semantic comprehension and generating capabilities (van Dijk et al., 2023; Tan et al., 2023; Ye et al., 2023), trained across extensive datasets, LLMs are expected to further improve the quality of topic modeling for short texts. Capitalizing on this potential, we introduce a novel model-agnostic mechanism: Topic Refinement. More concretely, we focus not on involving in topic mining but rather leverage LLMs via prompt engineering to refine topics initially mined by base topic models. This mechanism systematically iterates over each topic and constructs prompts to request LLMs. It sequentially assumes a word as the potential intruder word in a topic while the remaining words represent that topic. Prompts with LLMs are then employed to identify the topic and assess if the word aligns with the semantic expression of the other words. If alignment is confirmed, the word is retained; otherwise, coherent words are generated as candidates to replace the intruder word. Essentially, topic refinement mimics the human process of identifying, evaluating, and refining the extracted topics to make them more coherent and easier for users to discern.

Overall, the main contributions of our work are as follows:

  • We introduce Topic Refinement, a model-agnostic mechanism for short text topic modeling, pioneering a new pathway to improve topic quality after initial mining.

  • Topic refinement innovatively integrates LLM-based prompt engineering with diverse topic models to precisely refine and optimize the representative words for each topic.

  • Extensive experiments are conducted on three short text datasets to evaluate the effectiveness of our mechanism, particularly in enhancing the semantic coherence of topics.

2. Proposed Method

2.1. Problem Definition

Topic refinement aims to enhance the semantic coherence and clarity of topics generated from base topic models for short texts. Formally, given a set of topics T={t1,t2,,tK}𝑇subscript𝑡1subscript𝑡2subscript𝑡𝐾T=\left\{t_{1},t_{2},\cdots,t_{K}\right\}italic_T = { italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_t start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT } discovered by a base topic model, each topic consists of N𝑁Nitalic_N representative words, denoted by ti={wi1,wi2,wiN}subscript𝑡𝑖subscript𝑤𝑖1subscript𝑤𝑖2subscript𝑤𝑖𝑁t_{i}=\left\{w_{i1},w_{i2},\cdots w_{iN}\right\}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_w start_POSTSUBSCRIPT italic_i 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_i 2 end_POSTSUBSCRIPT , ⋯ italic_w start_POSTSUBSCRIPT italic_i italic_N end_POSTSUBSCRIPT }, where 1iK1𝑖𝐾1\leq i\leq K1 ≤ italic_i ≤ italic_K. Within this context, an intruder word is defined as a word within a topic that is not semantically coherent with the other words, thus disrupting the coherence. The topic refinement mechanism seeks to transform each topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT into a refined topic tisubscriptsuperscript𝑡𝑖t^{\prime}_{i}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT by correcting such intruder words wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT in its word set tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. This transformation is guided by a refinement function \mathcal{R}caligraphic_R, which determines whether to retain or replace the potentially intruder word wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT in tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as follows:

(1) :(ti{wij},wij)wij,:subscript𝑡𝑖subscript𝑤𝑖𝑗subscript𝑤𝑖𝑗subscriptsuperscript𝑤𝑖𝑗\mathcal{R}:\left(t_{i}\setminus\{w_{ij}\},w_{ij}\right)\to w^{\prime}_{ij},caligraphic_R : ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ { italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT } , italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) → italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ,

where wijsubscriptsuperscript𝑤𝑖𝑗w^{\prime}_{ij}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is the alternative word for topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We propose to model the refinement function via LLMs, leveraging their advanced capabilities in understanding and generating.

2.2. Technical Details of Topic Refinement

Table 1. Main results on three datasets. TR refers to the topic refinement mechanism.
Base Models AGNews TagMyNews YahooAnswer
CAsubscript𝐶𝐴C_{A}italic_C start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT CPsubscript𝐶𝑃C_{P}italic_C start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT CVsubscript𝐶𝑉C_{V}italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT UCI𝑈𝐶𝐼UCIitalic_U italic_C italic_I NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I CAsubscript𝐶𝐴C_{A}italic_C start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT CPsubscript𝐶𝑃C_{P}italic_C start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT CVsubscript𝐶𝑉C_{V}italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT UCI𝑈𝐶𝐼UCIitalic_U italic_C italic_I NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I CAsubscript𝐶𝐴C_{A}italic_C start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT CPsubscript𝐶𝑃C_{P}italic_C start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT CVsubscript𝐶𝑉C_{V}italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT UCI𝑈𝐶𝐼UCIitalic_U italic_C italic_I NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I
LDA 0.174 0.220 0.373 0.062 0.036 0.189 0.237 0.391 0.021 0.038 0.190 0.278 0.378 0.510 0.063
  + TR 0.200\uparrow 0.330\uparrow 0.394\uparrow 0.472\uparrow 0.064\uparrow 0.216\uparrow 0.384\uparrow 0.409\uparrow 0.615\uparrow 0.077\uparrow 0.220\uparrow 0.364\uparrow 0.396\uparrow 0.776\uparrow 0.086\uparrow
BTM 0.166 0.153 0.367 -0.034 0.023 0.177 0.142 0.389 -0.594 0.005 0.172 0.185 0.365 0.219 0.036
  + TR 0.205\uparrow 0.349\uparrow 0.395\uparrow 0.702\uparrow 0.074\uparrow 0.216\uparrow 0.357\uparrow 0.409\uparrow 0.445\uparrow 0.066\uparrow 0.204\uparrow 0.308\uparrow 0.389\uparrow 0.527\uparrow 0.065\uparrow
G-BAT 0.181 0.192 0.398 -0.622 0.012 0.216 0.332 0.407 0.302 0.061 0.213 0.265 0.399 0.304 0.062
  + TR 0.204\uparrow 0.337\uparrow 0.403\uparrow 0.215\uparrow 0.056\uparrow 0.237\uparrow 0.414\uparrow 0.424\uparrow 0.719\uparrow 0.087\uparrow 0.235\uparrow 0.386\uparrow 0.415\uparrow 0.730\uparrow 0.090\uparrow
CNTM 0.266 0.469 0.448 0.744 0.101 0.293 0.515 0.475 0.608 0.103 0.294 0.534 0.455 1.311 0.141
  + TR 0.279\uparrow 0.528\uparrow 0.458\uparrow 1.019\uparrow 0.118\uparrow 0.303\uparrow 0.575\uparrow 0.484\uparrow 1.029\uparrow 0.124\uparrow 0.301\uparrow 0.554\uparrow 0.461\uparrow 1.420\uparrow 0.147\uparrow
TSCTM 0.205 0.155 0.437 -1.705 -0.022 0.209 0.153 0.465 -1.911 -0.027 0.215 0.234 0.437 -0.919 0.016
  + TR 0.238\uparrow 0.379\uparrow 0.442\uparrow -0.001\uparrow 0.060\uparrow 0.242\uparrow 0.332\uparrow 0.468\uparrow -0.568\uparrow 0.039\uparrow 0.239\uparrow 0.394\uparrow 0.442\uparrow 0.173\uparrow 0.072\uparrow
BERTopic 0.228 0.295 0.442 -0.458 0.038 0.217 0.287 0.459 -1.090 0.012 0.252 0.421 0.448 0.533 0.094
  + TR 0.257\uparrow 0.433\uparrow 0.451\uparrow 0.401\uparrow 0.083\uparrow 0.235\uparrow 0.416\uparrow 0.469\uparrow -0.257\uparrow 0.054\uparrow 0.258\uparrow 0.457\uparrow 0.451\uparrow 0.730\uparrow 0.105\uparrow
ECRTM 0.203 0.125 0.443 -2.387 -0.053 0.190 0.086 0.451 -2.623 -0.064 0.247 0.315 0.454 -0.829 0.027
  + TR 0.254\uparrow 0.464\uparrow 0.447\uparrow 0.243\uparrow 0.072\uparrow 0.235\uparrow 0.347\uparrow 0.443 -0.509\uparrow 0.036\uparrow 0.292\uparrow 0.496\uparrow 0.470\uparrow 0.590\uparrow 0.099\uparrow

Figure 1 depicts the overall architecture of our topic refinement mechanism. Prior to this mechanism, we extract a set of topics T={t1,t2,,tK}𝑇subscript𝑡1subscript𝑡2subscript𝑡𝐾T=\left\{t_{1},t_{2},\cdots,t_{K}\right\}italic_T = { italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_t start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT } and the vocabulary V𝑉Vitalic_V from short texts through a base topic model. This base model can utilize existing methods in the field, such as probabilistic topic models or neural topic models. Consequently, our mechanism unfolds in three phases: prompt construction, LLM request and response, and iterative refinement.

Refer to caption
Figure 2. LLM prompt template for topic refinement.

Prompt Construction. Prompt engineering serves as the direct way to guide and control the output generated by LLMs. To model the refinement function \mathcal{R}caligraphic_R, our prompt template is designed to guide LLMs to find intruder words in each topic and generate alternative words related to this topic. Given the topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and the words {wi1,wi2,wiN}subscript𝑤𝑖1subscript𝑤𝑖2subscript𝑤𝑖𝑁\left\{w_{i1},w_{i2},\cdots w_{iN}\right\}{ italic_w start_POSTSUBSCRIPT italic_i 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_i 2 end_POSTSUBSCRIPT , ⋯ italic_w start_POSTSUBSCRIPT italic_i italic_N end_POSTSUBSCRIPT } in it, we structure the prompt template into two tasks. Firstly, we employ LLM \mathcal{M}caligraphic_M to identify the common topic shared by the ¡TOPIC_WORDS¿, which is represented by words ti{wij}subscript𝑡𝑖subscript𝑤𝑖𝑗t_{i}\setminus\{w_{ij}\}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ { italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT }. Later, LLM \mathcal{M}caligraphic_M assesses whether the word wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT, denoted as ¡WORD¿, is semantically consistent with the previously determined common topic. In cases of inconsistency, LLM \mathcal{M}caligraphic_M is also tasked with suggesting ten commonly used words closely related to that topic as candidate alternatives. For the clarity and diversity of topics, these candidate words should be easily recognizable and distinctly different from those in the provided list. Moreover, to facilitate subsequent processing of generated results, the prompt template also enforces the format of generated texts to be in JSON. The prompt template is defined as p=Prompt𝑝Promptp=\mathrm{Prompt}italic_p = roman_Prompt(¡TOPIC_WORDS¿, ¡WORD¿), which is used to construct each iteration of prompts p𝑝pitalic_p for LLM \mathcal{M}caligraphic_M. The details of the prompt template are presented in Figure 2.

LLM Request and Response. After constructing the prompt p=Prompt(ti{wij},wij)𝑝Promptsubscript𝑡𝑖subscript𝑤𝑖𝑗subscript𝑤𝑖𝑗p=\mathrm{Prompt}(t_{i}\setminus\{w_{ij}\},w_{ij})italic_p = roman_Prompt ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ { italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT } , italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ), we initiate requests to LLM \mathcal{M}caligraphic_M for its response. Essentially, the LLM models the topic refinement function by estimating the conditional probability of the generated text: PLLM(s1,s2,,snp)subscriptPLLMsubscript𝑠1subscript𝑠2conditionalsubscript𝑠𝑛𝑝\mathrm{P_{LLM}}(s_{1},s_{2},\cdots,s_{n}\mid p)roman_P start_POSTSUBSCRIPT roman_LLM end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_s start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∣ italic_p ), where (s1,s2,,sn)subscript𝑠1subscript𝑠2subscript𝑠𝑛(s_{1},s_{2},\cdots,s_{n})( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_s start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) is the generated word sequence with variable lengths. The generated sequence, formatted in JSON, is then converted into our desired results by a simple mapping function (r1,r2,r3)=f(s1,s2,,sn)subscript𝑟1subscript𝑟2subscript𝑟3𝑓subscript𝑠1subscript𝑠2subscript𝑠𝑛(r_{1},r_{2},r_{3})=f(s_{1},s_{2},\cdots,s_{n})( italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) = italic_f ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_s start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). r1subscript𝑟1r_{1}italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT represents the topic identified by LLM \mathcal{M}caligraphic_M, typically composed of one word or phrase summarizing this topic. r2subscript𝑟2r_{2}italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is the judgment of whether the ¡WORD¿ in the prompt p𝑝pitalic_p is semantically coherent with the identified topic, with the response being either ”Yes” or ”No”. r3subscript𝑟3r_{3}italic_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT is a list of ten candidate words for replacement, generated only if r2subscript𝑟2r_{2}italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is ”No”; otherwise, it remains empty. The process above is formalized as follows:

(2) (r1,r2,r3)=(p)=(Prompt(ti{wij},wij)).subscript𝑟1subscript𝑟2subscript𝑟3𝑝Promptsubscript𝑡𝑖subscript𝑤𝑖𝑗subscript𝑤𝑖𝑗(r_{1},r_{2},r_{3})=\mathcal{M}(p)=\mathcal{M}(\mathrm{Prompt}(t_{i}\setminus% \{w_{ij}\},w_{ij})).( italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) = caligraphic_M ( italic_p ) = caligraphic_M ( roman_Prompt ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ { italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT } , italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) ) .

Iterative Refinement. This phase, informed by the results from LLM \mathcal{M}caligraphic_M, selects a more coherent alternative to replace the intruder word when necessary. If r2subscript𝑟2r_{2}italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT indicates that a word wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT lies outside the semantic boundary of the topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, a list of ten alternative words is generated in r3subscript𝑟3r_{3}italic_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT. From this list, the alternative word wijsubscriptsuperscript𝑤𝑖𝑗w^{\prime}_{ij}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is selected based on its inclusion in the vocabulary V𝑉Vitalic_V. If none of the alternative words are found in the vocabulary, the mechanism opts for the most semantically similar word from V𝑉Vitalic_V, which is not already included in the topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. This ensures that the alternative word wijsubscriptsuperscript𝑤𝑖𝑗w^{\prime}_{ij}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is not only semantically coherent but also retains the lexical integrity of the dataset. The process iterates over each word in a topic in reverse order, for words are generally arranged according to topic relevance. By the end of this iteration, we arrive at a refined set of topics tisubscriptsuperscript𝑡𝑖t^{\prime}_{i}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The specific procedure of the topic refinement mechanism is demonstrated in Algorithm 1.

Algorithm 1 Topic Refinement Mechanism
0:  a set of topics T𝑇Titalic_T, vocabulary V𝑉Vitalic_V, large language model \mathcal{M}caligraphic_M
0:  refined topics T={t1,t2,,tK}superscript𝑇subscriptsuperscript𝑡1subscriptsuperscript𝑡2subscriptsuperscript𝑡𝐾T^{\prime}=\left\{t^{\prime}_{1},t^{\prime}_{2},\cdots,t^{\prime}_{K}\right\}italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT }
1:  Initialize the set of refined topics Tsuperscript𝑇T^{\prime}\leftarrow\varnothingitalic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ← ∅;
2:  for each topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in T𝑇Titalic_T do
3:     Initialize the refined topic titisubscriptsuperscript𝑡𝑖subscript𝑡𝑖t^{\prime}_{i}\leftarrow t_{i}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ← italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT;
4:     for each word (wiN,wi(N1),,wi1)subscript𝑤𝑖𝑁subscript𝑤𝑖𝑁1subscript𝑤𝑖1(w_{iN},w_{i(N-1)},\cdots,w_{i1})( italic_w start_POSTSUBSCRIPT italic_i italic_N end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_i ( italic_N - 1 ) end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_i 1 end_POSTSUBSCRIPT ) in tisubscriptsuperscript𝑡𝑖t^{\prime}_{i}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
5:        Construct the prompt pPrompt(ti{wij},wij)𝑝Promptsubscript𝑡𝑖subscript𝑤𝑖𝑗subscript𝑤𝑖𝑗p\leftarrow\mathrm{Prompt}(t_{i}\setminus\{w_{ij}\},w_{ij})italic_p ← roman_Prompt ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∖ { italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT } , italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT );
6:        Generate the answer (r1,r2,r3)subscript𝑟1subscript𝑟2subscript𝑟3(r_{1},r_{2},r_{3})( italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) by Eq. 2;
7:        if the answer r2subscript𝑟2r_{2}italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is ”NO” then
8:           Select the alternative word wijsubscriptsuperscript𝑤𝑖𝑗w^{\prime}_{ij}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT from r3subscript𝑟3r_{3}italic_r start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT and V𝑉Vitalic_V;
9:           Replace wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT in topic tisubscriptsuperscript𝑡𝑖t^{\prime}_{i}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT with wijsubscriptsuperscript𝑤𝑖𝑗w^{\prime}_{ij}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT;
10:        else
11:           Retain wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT in topic tisubscriptsuperscript𝑡𝑖t^{\prime}_{i}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT;
12:        end if
13:     end for
14:  end for

3. Experiments

Refer to caption
Figure 3. Ablation study results with various LLMs. The error bars denote a 95% confidence interval for the statistical variability of results across seven base models.

3.1. Experimental Setup

Datasets. We employ the following short text datasets for evaluation: (1) AGNews (Zhang et al., 2015): more than one million news articles across four categories gathered by ComeToMyHead. We randomly sample 5,000 article titles from each category for experiments. (2) TagMyNews (Vitale et al., 2012): news titles from various news sources, annotated with seven labels. (3) YahooAnswer (Zhang et al., 2015): ten categories of questions and their corresponding best answers from the Yahoo Answers website. Similar to AGNews, we sample 2,500 question titles from each category. Table 2 shows the statistics of these datasets after preprocessing with reference to Wu et al. (2022).

Table 2. Statistics of datasets after preprocessing.
Datasets #Docs #Labels #Vocab. Avg. Length
AGNews 14,845 4 3,290 4.268
TagMyNews 27,369 7 4,325 4.483
YahooAnswer 12,258 10 3,423 4.151

Base Models. We utilize the topic refinement with the following base topic models: (1) LDA (Blei et al., 2003) that uses Gibbs sampling for parameter estimation and inference, with GibbsLDA++111The GibbsLDA++ is available at https://gibbslda.sourceforge.net. implementation in our experiment. (2) BTM (Cheng et al., 2014) that is designed for short texts to enrich word co-occurrence within them222The code of BTM is available at https://github.com/xiaohuiyan/BTM.. (3) Gaussian-BAT (Wang et al., 2020) that incorporates word relatedness information by multivariate Gaussian modeling of topics. (4) CNTM (Nguyen and Luu, 2021) that leverages contrastive learning and word embedding to capture semantic patterns of documents333The code of CNTM is available at https://github.com/nguyentthong/CLNTM.. (5) TSCTM (Wu et al., 2022) that is based on contrastive learning with quantified sampling strategies to improve topic quality in short texts444The code of TSCTM is available at https://github.com/bobxwu/TSCTM.. (6) BERTopic (Grootendorst, 2022) that utilizes pre-trained transformer-based language models to extract topics555The implementation of BERTopic is available at https://maartengr.github.io/BERTopic.. (7) ECRTM (Wu et al., 2023) that forces each topic embedding to be the center of a word embedding cluster in the semantic space via optimal transport666The code of ECRTM is available at https://github.com/BobXWu/ECRTM.. It is noteworthy that we consider both probabilistic topic models, such as LDA and BTM, and advanced variants of neural topic models that integrate external knowledge. All base models are executed with their recommended hyperparameters.

Evaluation Metrics. Following previous mainstream work, we evaluate the topic quality by topic coherence metrics, measuring the coherence between the N𝑁Nitalic_N words of discovered topics. We adopt the widely-used Palmetto777The Palmetto library is available at https://github.com/dice-group/Palmetto. to calculate these coherence metrics CAsubscript𝐶𝐴C_{A}italic_C start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT, CPsubscript𝐶𝑃C_{P}italic_C start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT, CVsubscript𝐶𝑉C_{V}italic_C start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT, UCI𝑈𝐶𝐼UCIitalic_U italic_C italic_I, and NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I (Röder et al., 2015), with higher values indicating better topic quality. This library employs Wikipedia articles as a reference corpus to estimate the co-occurrence probabilities of words.

Implementation details. For our topic refinement mechanism, we use GPT-3.5-turbo as the LLM \mathcal{M}caligraphic_M and set its temperature as 0 to lower the completion randomness. Keeping with prior work, each topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT mined by base topic models is represented by N𝑁Nitalic_N words, where N𝑁Nitalic_N is typically set to 10.

3.2. Main Results

We utilize base topic models with K=20𝐾20K=20italic_K = 20 and 50505050 for experiments in this part. The average results, as depicted in Table 1, reveal the effectiveness of the topic refinement mechanism across various datasets and base models. Notably, incorporating topic refinement consistently enhances the topic coherence of all base models, with increases in nearly every evaluated metric. This improvement is especially pronounced in AGNews and TagMyNews datasets, where base models initially showed lower performance metrics. In effect, topic refinement indirectly mitigates the challenges of data sparsity in short texts by refining topics. Even for base models that initially exhibited high coherence, like CNTM and BERTopic, topic refinement also boosts their performance. Collectively, the uniform advancements prove the model-agnostic nature of our mechanism.

Moreover, Table 3 presents the average token costs for each topic across various datasets, showcasing the efficiency of our mechanism. The input token costs are relatively consistent across datasets due to the similar prompts. Interestingly, the output token costs suggest differences in the complexity of topic modeling for each dataset. For instance, the YahooAnswer dataset, which already exhibited relatively higher-quality topics in the base models, requires fewer tokens for refinement. The implication is that datasets with lower modeling complexity require less intervention from our mechanism.

Table 3. Average token costs for each topic.
Datasets Input tokens Output tokens
AGNews 1589.89 264.21
TagMyNews 1595.61 274.46
YahooAnswer 1579.12 226.27

3.3. Ablation Study and Visualization

We additionally carry out ablation studies to evaluate the impact of different LLMs, such as PaLM2, GPT-3.5, Gemini Pro, and GPT-4, on the TagMyNews dataset with K=20𝐾20K=20italic_K = 20. Figure 3 provides a detailed comparison of coherence metrics and the number of altered words within the topics after topic refinement. The implementation of all tested LLMs clearly improves topic coherence, underscoring the efficacy and robustness of our mechanism. It is noteworthy that LLMs with more recent and superior capabilities exhibit a better performance for refinement, with GPT-4 emerging as the most effective among them. Although Gemini Pro and GPT-4 yield comparable results in coherence metrics, GPT-4 shows fewer alterations in the topics, indicating its precision in refining topics.

Refer to caption
(a) Comparison of NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I for word wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT and wijsubscriptsuperscript𝑤𝑖𝑗w^{\prime}_{ij}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT.
Refer to caption
(b) Comparison of average NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I for topic tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and tisubscriptsuperscript𝑡𝑖t^{\prime}_{i}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.
Figure 4. Coherence visualization.

Figure 4 displays the coherence visualization of ECRTM’s (Wu et al., 2023) topics on the AGNews dataset with K=20𝐾20K=20italic_K = 20. In the visualization, we compute the metric NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I for each word in a topic against the other N1𝑁1N-1italic_N - 1 words to assess word coherence. Figure 4(a) compares the NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I for individual words within the topics before and after refinement, indicated by wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT and wijsubscriptsuperscript𝑤𝑖𝑗w^{\prime}_{ij}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT, respectively. The heatmaps show a visible shift towards higher values after refinement to evidence the enhancement in word-topic relatedness. Figure 4(b) further contrasts the average NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I from base topics to refined topics, while the refined topics generally reveal more positive NPMI𝑁𝑃𝑀𝐼NPMIitalic_N italic_P italic_M italic_I values. This improvement is proved by the error bars with 95% confidence interval, which are notably tighter for the refined topics.

4. Conclusion

This paper explores the potential of using LLMs to enhance the topic modeling quality in short texts. Our innovative mechanism, Topic Refinement, utilizes prompt engineering with LLMs to identify and correct semantically intruder words within extracted topics. Extensive experiments across the three datasets and seven base topic models conclusively demonstrate the effectiveness of this mechanism. Future research could involve the dynamic adjustment of the topic modeling process based on LLM feedback, directly tackling the challenge of data sparsity in short texts.

Acknowledgements.
This work was partially supported by the National Natural Science Foundation of China (62102192) and the Postgraduate Research & Practice Innovation Program of Jiangsu Province (KYCX23_1077).

References

  • (1)
  • Blei et al. (2003) David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research 3 (2003), 993–1022.
  • Brown et al. (2020) Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. In Proceedings of the 34th Annual Conference on Neural Information Processing Systems, Vol. 33. Curran Associates, Inc., 1877–1901.
  • Cheng et al. (2014) Xueqi Cheng, Xiaohui Yan, Yanyan Lan, and Jiafeng Guo. 2014. BTM: Topic Modeling over Short Texts. IEEE Transactions on Knowledge and Data Engineering 26, 12 (2014), 2928–2941.
  • Chowdhery et al. (2023) Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov, and Noah Fiedel. 2023. PaLM: Scaling Language Modeling with Pathways. Journal of Machine Learning Research 24, 240 (2023), 1–113.
  • Churchill et al. (2022) Robert Churchill, Lisa Singh, Rebecca Ryan, and Pamela Davis-Kean. 2022. A Guided Topic-Noise Model for Short Texts. In Proceedings of the ACM Web Conference 2022. ACM, 2870–2878.
  • Grootendorst (2022) Maarten Grootendorst. 2022. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv:2203.05794
  • Han et al. (2023) Sungwon Han, Mingi Shin, Sungkyu Park, Changwook Jung, and Meeyoung Cha. 2023. Unified Neural Topic Model via Contrastive Learning and Term Weighting. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 1794–1809.
  • Li et al. (2016) Chenliang Li, Haoran Wang, Zhiqian Zhang, Aixin Sun, and Zongyang Ma. 2016. Topic Modeling for Short Texts with Auxiliary Word Embeddings. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 165–174.
  • Lin et al. (2020) Lihui Lin, Hongyu Jiang, and Yanghui Rao. 2020. Copula Guided Neural Topic Modelling for Short Texts. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1773–1776.
  • Meng et al. (2022) Yu Meng, Yunyi Zhang, Jiaxin Huang, Yu Zhang, and Jiawei Han. 2022. Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations. In Proceedings of the ACM Web Conference 2022. ACM, 3143–3152.
  • Nguyen and Luu (2021) Thong Nguyen and Anh Tuan Luu. 2021. Contrastive Learning for Neural Topic Model. In Proceedings of the 35th Annual Conference on Neural Information Processing Systems, Vol. 34. Curran Associates, Inc., 11974–11986.
  • Ouyang et al. (2022) Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul F. Christiano, Jan Leike, and Ryan Lowe. 2022. Training Language Models to Follow Instructions with Human Feedback. In Proceedings of the 36th Annual Conference on Neural Information Processing Systems, Vol. 35. Curran Associates, Inc., 27730–27744.
  • Pang et al. (2021) Jianhui Pang, Yanghui Rao, Haoran Xie, Xizhao Wang, Fu Lee Wang, Tak-Lam Wong, and Qing Li. 2021. Fast Supervised Topic Models for Short Text Emotion Detection. IEEE Transactions on Cybernetics 51, 2 (2021), 815–828.
  • Qiang et al. (2022) Jipeng Qiang, Zhenyu Qian, Yun Li, Yunhao Yuan, and Xindong Wu. 2022. Short Text Topic Modeling Techniques, Applications, and Performance: A Survey. IEEE Transactions on Knowledge and Data Engineering 34, 3 (2022), 1427–1445.
  • Röder et al. (2015) Michael Röder, Andreas Both, and Alexander Hinneburg. 2015. Exploring the Space of Topic Coherence Measures. In Proceedings of the 8th ACM International Conference on Web Search and Data Mining. ACM, 399–408.
  • Tan et al. (2023) Qingyu Tan, Hwee Tou Ng, and Lidong Bing. 2023. Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 14820–14835.
  • van Dijk et al. (2023) Bram van Dijk, Tom Kouwenhoven, Marco Spruit, and Max Johannes van Duijn. 2023. Large Language Models: The Need for Nuance in Current Debates and a Pragmatic Perspective on Understanding. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 12641–12654.
  • Vitale et al. (2012) Daniele Vitale, Paolo Ferragina, and Ugo Scaiella. 2012. Classification of Short Texts by Deploying Topical Annotations. In Proceedings of the 34th European Conference on Information Retrieval, Vol. 7224. Springer, 376–387.
  • Wang et al. (2023) Qingren Wang, Chengcheng Zhu, Yiwen Zhang, Hong Zhong, Jinqin Zhong, and Victor S. Sheng. 2023. Short Text Topic Learning Using Heterogeneous Information Network. IEEE Transactions on Knowledge and Data Engineering 35, 5 (2023), 5269–5281.
  • Wang et al. (2020) Rui Wang, Xuemeng Hu, Deyu Zhou, Yulan He, Yuxuan Xiong, Chenchen Ye, and Haiyang Xu. 2020. Neural Topic Modeling with Bidirectional Adversarial Training. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 340–350.
  • Wu et al. (2023) Xiaobao Wu, Xinshuai Dong, Thong Thanh Nguyen, and Anh Tuan Luu. 2023. Effective Neural Topic Modeling with Embedding Clustering Regularization. In Proceedings of the 40th International Conference on Machine Learning, Vol. 202. PMLR, 37335–37357.
  • Wu et al. (2020) Xiaobao Wu, Chunping Li, Yan Zhu, and Yishu Miao. 2020. Short Text Topic Modeling with Topic Distribution Quantization and Negative Sampling Decoder. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1772–1782.
  • Wu et al. (2022) Xiaobao Wu, Anh Tuan Luu, and Xinshuai Dong. 2022. Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2748–2760.
  • Yang et al. (2021) Tianchi Yang, Linmei Hu, Chuan Shi, Houye Ji, Xiaoli Li, and Liqiang Nie. 2021. HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification. ACM Transactions on Information Systems 39, 3 (2021), 32:1–32:29.
  • Ye et al. (2023) Yunhu Ye, Binyuan Hui, Min Yang, Binhua Li, Fei Huang, and Yongbin Li. 2023. Large Language Models are Versatile Decomposers: Decomposing Evidence and Questions for Table-based Reasoning. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 174–184.
  • Zhang and Lauw (2022) Delvin Ce Zhang and Hady W. Lauw. 2022. Meta-Complementing the Semantics of Short Texts in Neural Topic Models. In Proceedings of the 36th Annual Conference on Neural Information Processing Systems, Vol. 35. Curran Associates, Inc., 29498–29511.
  • Zhang et al. (2023) Huakui Zhang, Yi Cai, Haopeng Ren, and Qing Li. 2023. Multimodal Topic Modeling by Exploring Characteristics of Short Text Social Media. IEEE Transactions on Multimedia 25 (2023), 2430–2445.
  • Zhang et al. (2022) Wei Zhang, Chao Dong, Jianhua Yin, and Jianyong Wang. 2022. Attentive Representation Learning With Adversarial Training for Short Text Clustering. IEEE Transactions on Knowledge and Data Engineering 34, 11 (2022), 5196–5210.
  • Zhang et al. (2015) Xiang Zhang, Junbo Jake Zhao, and Yann LeCun. 2015. Character-level Convolutional Networks for Text Classification. In Proceedings of the 29th Annual Conference on Neural Information Processing Systems, Vol. 28. Curran Associates, Inc., 649–657.
  • Zhou et al. (2023) Xixi Zhou, Jiajun Bu, Sheng Zhou, Zhi Yu, Ji Zhao, and Xifeng Yan. 2023. Improving topic disentanglement via contrastive learning. Information Processing & Management 60, 2 (2023), 103164.
  • Zuo et al. (2023) Yuan Zuo, Congrui Li, Hao Lin, and Junjie Wu. 2023. Topic Modeling of Short Texts: A Pseudo-Document View With Word Embedding Enhancement. IEEE Transactions on Knowledge and Data Engineering 35, 1 (2023), 972–985.