Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S1_4_and_S5.zip数据含义 #5

Open
qingaidexin opened this issue Sep 2, 2017 · 1 comment
Open

S1_4_and_S5.zip数据含义 #5

qingaidexin opened this issue Sep 2, 2017 · 1 comment

Comments

@qingaidexin
Copy link

qingaidexin commented Sep 2, 2017

你好,我想问下 S1_4_and_S5.zip 里面数据的含义
0 1:0.000000 2:0.000000 3:0.000000 4:0.000000 5:0.000000 6:0.000000 7:0.000000 8:0.000000 9:0.000000 10:0.000000 11:0.000000 12:0.000000 13:0.000000 14:0.000000 15:0.000000 16:0.001348 17:0.000000 18:0.222222 19:0.000000 20:0.001282 21:0.000000 22:0.000000 23:0.000000 24:0.000000 25:0.000000 26:0.000000 27:0.000000 28:0.000000 29:0.000000 30:0.000000 31:0.000000 32:0.000000 33:0.000000 34:0.000000 35:0.000000 36:0.000000 37:0.000000 38:0.000000 39:0.000000 40:0.000000 41:0.000000 42:0.000000 43:0.017241 44:0.000000 45:0.000000 46:0.000000 #10,GX000-00-0000000

这样一条数据,代表的是什么意思呢? 第一个0 是是否点击? 那后面的这些特征代表的含义是什么啊,还有为什么后面的数是小数呢,代表的含义是什么呢? 如果以ml-1m电影数据为例,期待您的回答

@ghost
Copy link

ghost commented Mar 29, 2018

这是一种特殊的数据格式。倘若你想要使用自己的数据。你需要先用把自己原始的数据转化为svmlight格式。在sklearn里面有一个函数具有这样的功能sklearn.datasets.dump_svmlight_file

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant