Extracting Lexical Semantics from Text

The following are a collection of papers relevant to extracting lexical semantics (syntactics and association) from text corpora.

  1. The Vector Space Model (Salton et al., 1975)
  2. Latent Semantic Analysis (Deerwester et al., 1991; Landauer & Dumais, 1997)
  3. Hyperspace Analog to Language (Lund & Burgess, 1996)
  4. Semi Discrete Matrix Decomposition (Kolda & O'Leary, 1998)
  5. The Syntagmatic Paradigmatic Model (Dennis, 2003)
  6. Pooled Adjacent Context model (Redington, Chater, & Finch, 1998)
  7. Probabilistic Latent Semantic Indexing (Hofmann, 2001)
  8. Latent Dirichlet Allocation (Blei, Ng, & Jordan, 2002)
  9. The topics model (Griffiths & Steyvers, 2002)
  10. Word Association Space (Steyvers, Shiffrin & Nelson, 2000)
  11. Non-negative matrix factorization (Lee & Seung, 1999)(Ge & Iwata, 2002)
  12. Local linear embedding (Roweis & Saul, 2000)
  13. Independent Components Analysis (Isbell & Viola 1998)
  14. Information Bottleneck (Slonim & Tishby 2000)
  15. Local LSI (Schutze, Hull & Pedersen 1995)
  16. UNICON (Lin & Pantel 2001)
Some other useful material.