Topic Modeling in NLP 공부 커리큘럼 정리

2024. 1. 8. 23:45·Study/Topic Modeling

 

0. Latent Dirichlet Allocation (LDA) (David M Blei , 2003)

 

 

1. Neural Topic Model (NTM) 

 

1-1. ProdLDA (Neural-ProdLDA) (Srivastava and Sutton, 2017)

1-2. Combined TM (Bianchi et al., 2020)

1-3. ZeroshotTM (Bianchi et al., 2021)

 

2. Evaluation Metrics

C_v , Purity 

 

Top-Purity and Normalized Mutual Information(Top-NMI) as metrics(Nguyen et al., 2018)

The KMeans algorithm to topic proportions z and use the clustered documents to report purity(Km-Purity) and NMI Km-NMI (Zhao et al., 2020a)

 

 

2.1 Topic Coherence

 

2.1.1 Normalized Pointwise Mutual Information (NPMI) , (Lau et al., 2014) 

2.1.2 Word Embedding (WE) (Fang et al., 2016)

 

2.2 Topic Diversity

 

2.2.1 Topic Uniqueness (TU) (Dieng et al., 2020 , Topic modeling in embedding spaces )

2.2.2 Inversed Rank-Biased Overlap (I-RBO) (Bianchi et al., 2021 , Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence ) 

 

3. 참고

3.1 C_v , Purity

 

3.2 : Top-Purity and Normalized Mutual Information(Top-NMI) as metrics(Nguyen et al., 2018)

3.3 : The KMeans algorithm to topic proportions z and use the clustered documents to report purity(Km-Purity) and NMI : Km-NMI (Zhao et al., 2020a)

3.4 : Standard RBO (Webber et al., 2010; Terragni et al., 2021b)

 

 

 

 

 

'Study > Topic Modeling' 카테고리의 다른 글

Topic Modeling with Contrastive Learning papers  (0) 2024.04.17
Traditional Topic Model  (0) 2024.02.23
Preliminary for Topic Models  (0) 2024.02.23
Topic Modeling Task 정리하기  (0) 2024.02.21
Automatic Evaluation Metrics for Topic Modeling  (1) 2024.01.10
'Study/Topic Modeling' 카테고리의 다른 글
  • Traditional Topic Model
  • Preliminary for Topic Models
  • Topic Modeling Task 정리하기
  • Automatic Evaluation Metrics for Topic Modeling
Seung-won Seo
Seung-won Seo
ML , NLP , DL 에 관심이 많습니다. 반갑습니다 :P
  • Seung-won Seo
    Butterfly_Effect
    Seung-won Seo
    • 분류 전체보기 (78)
      • 일기장 (2)
      • 메모장 (1)
      • Plan (0)
      • To do List (0)
      • Paper Review (33)
      • Progress Meeting (0)
      • Research in NLP (14)
      • Progress for XTM (0)
      • Writing for XTM (0)
      • 논문작성 Tips (12)
      • Study (16)
        • Algorithm (0)
        • ML & DL (7)
        • NLP (2)
        • Statistics (1)
        • Topic Modeling (6)
  • 링크

  • hELLO· Designed By정상우.v4.10.3
Seung-won Seo
Topic Modeling in NLP 공부 커리큘럼 정리
상단으로

티스토리툴바