Extracting Lexical Semantics from Text
The following are a collection of papers relevant to extracting lexical semantics (syntactics and association) from text corpora.
- The Vector Space Model (Salton et al., 1975)
- Latent Semantic Analysis (Deerwester et al., 1991; Landauer & Dumais,
1997)
- Hyperspace Analog to Language (Lund & Burgess, 1996)
- Semi Discrete Matrix Decomposition (Kolda & O'Leary, 1998)
- The Syntagmatic Paradigmatic Model
(Dennis, 2003)
- Pooled Adjacent Context model (Redington, Chater, & Finch, 1998)
- Probabilistic Latent Semantic Indexing (Hofmann, 2001)
- Latent Dirichlet Allocation (Blei, Ng, &
Jordan, 2002)
- The topics model (Griffiths & Steyvers, 2002)
- Word Association Space (Steyvers,
Shiffrin & Nelson, 2000)
- Non-negative matrix factorization (Lee & Seung, 1999)(Ge & Iwata, 2002)
- Local linear embedding (Roweis & Saul,
2000)
- Independent Components Analysis (Isbell
& Viola 1998)
- Information Bottleneck (Slonim & Tishby
2000)
- Local LSI (Schutze, Hull & Pedersen
1995)
- UNICON (Lin & Pantel 2001)
Some other useful material.