Распределительная семантика: machine-learning framework for clustering

Цена договорная • безналичный расчёт, электронные деньги
10 мая 2019, 00:05 • 6 откликов • 40 просмотров
Implement a program to cluster a given list of target words into n groups based on their distributional co-occurrence patterns. Construct a word by-word matrix that captures co-occurrence patterns of the given target words using a given corpus. Define context and features flexibly to analyse the results. The program should take as input a list of words to cluster and a number of clusters. Use any available machine-learning framework for clustering (e.g. sckikit-learn (http://scikit-learn.org/) or Weka (http://www.cs.waikato.ac.nz/ml/weka/)).