Your data analysis solution
Name is required.
Email address is required.
Invalid email address
Answer is required.
Exceeding max length of 5KB

Algorithm for high dimensional data clustering

By Ahmed | Aug 18, 2016 04:39PM CEST

Dear Experts

I'm using XLSTAT currently for my data analysis and clustering,

i have done my analysis for one Web site log file, and build session-page view matrix in excel file , this file consist of thousands of session (Rows) and 128 columns that represent occurrence of particular page in every session.
i would like to know what is the best algorithm can be used for clustering similar session and i would be thank you if there is any tutorial or example ?

the following line represent one row from my matrix
s1 5 15 0 0 0 129 3...etc

thanks

Up

0

Down

By Thierry | Aug 18, 2016 08:46PM CEST | XLSTAT Agent

Hello,

You might want to start with kmeans. It is fast and performs well.

Best regards,

Thierry

This question has received the maximum number of answers.

Contact Us

1c26995d494fb3061dd0ae8571ffc0a4@xlstat.desk-mail.com
https://cdn.desk.com/
false
desk
Loading
seconds ago
a minute ago
minutes ago
an hour ago
hours ago
a day ago
days ago
about
false
Invalid characters found
/customer/portal/articles/autocomplete
9283