Skip to content
 

Data cleaning tool!

Hal Varian writes:

You might find this a useful tool for cleaning data.

I haven’t tried it out yet, but data cleaning is a hugely important topic and so this could be a big deal.

3 Comments

  1. TheOneEyedMan says:

    Just installed it. Looks like a fantastic tool.

  2. Tom Hopper says:

    I've tried it out, once on real data, and it's brilliant.

  3. Joel says:

    I've been using it at work this week on a set of about 120,000 rows over 25 variables. It works great, although you might have to turn up the limit on the number of facets for big sets. It runs on java, can use an occasional refresh, and the clustering tools and expression language are excellent. Plus, unlimited undo/redo helps with experimenting