A question about the data set

« Prev
Topic
» Next
Topic
suvor's image Posts 2
Joined 8 Apr '12 Email user

Hi, everybody! Help me, please.. I'm not sure I got the right data.. (HHP_release3.zip)

- At opening a data file (Members.csv for example) Excel write "The file is loaded not completely" !!!?

-Each data file contains equally 65535 records, but some MembersID are repeat, and others are absent !! Then, why the number of records is equally?

 So, I'm afraid I got the data not completely.. Thanks..

 
Signipinnis's image Posts 94
Thanks 25
Joined 8 Apr '11 Email user

I believe that's the max-row limit for Microsoft Excel-2007 and before. So it's importing that much data, and then stopping. You can get a good sense of the data by looking at that portion of it in Excel-2007, but it would probably be a better plan to do most of your actual data munging and modeling etc with some other software tool/s, given the size of the data in this contest.

Excel-2010 will handle much more data than the older versions. That alone doesn't necessarily make it the best tool for this job, unless you are very, very good with Excel, and determined.

 
suvor's image Posts 2
Joined 8 Apr '12 Email user

What a stupid "Excel" ! I didn't like it always.. Tnanks!

 
Metatron Associates's image Posts 1
Joined 16 Apr '12 Email user

For Excel 2010, the number of rows and columns increased to 1,048,576

But what are the actual sizes of the data sets provided (i.e. columns v. rows)?

 
Signipinnis's image Posts 94
Thanks 25
Joined 8 Apr '11 Email user

The Claims dataset is the longest, it's a little under 2.7 million rows.

 
ADP's image
ADP
Posts 12
Thanks 1
Joined 21 Aug '11 Email user

suvor wrote:

What a stupid "Excel" ! I didn't like it always.. Tnanks!

What a classic comment. Give that man a beer.

 

Reply

Flag alert Flagging is a way of notifying administrators that this message contents inappropriate or abusive content. Are you sure this forum post qualifies?