spark-scala-KMeans

Here is a basic KMeans algorithm in spark/scala. It uses arrays - it makes no use of the MLLIB. If you are in a hurry and want to develop on something basic, then this can be a good statrting point. The input txt file is just a tab delimited txt file. Each row in this text file represents a point. Set numClusters (number of clusters you'd like to have) and numIterations before using it. At the end, this code just prints out the centers and the center to which each point is mapped to. Please let me know if you have amy questions ([email protected]).

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
basicKMeans.txt		basicKMeans.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spark-scala-KMeans

About

Releases

Packages

kardes/spark-scala-KMeans

Folders and files

Latest commit

History

Repository files navigation

spark-scala-KMeans

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages