Stefano Mazzocchi
16a2600a49
now it's jslint time to be happier: (!= null) -> (!== null)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@432 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 01:00:44 +00:00
Will Moffat
fff6eb0aa2
JSLint: legimate comparisions to 0 and null should be made using ===
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@425 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:18:38 +00:00
Will Moffat
41a52f135d
JSLint: foo == 0, foo == null --> !foo
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@424 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:16:08 +00:00
Will Moffat
94aa2e0168
JSLint: semicolons
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@423 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 19:52:23 +00:00
Will Moffat
fbc1d04efb
JSLint fixes
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@421 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 18:21:28 +00:00
David Huynh
f402db10af
Implemented inter-project joins.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@387 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 05:35:48 +00:00
Stefano Mazzocchi
5884d257db
default to 'relevance' instead of recon (which is faster). Change to recon if the user suggests schema hooks.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@363 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 01:07:17 +00:00
David Huynh
b54f7162a8
Made histogram widget capable of highlighting the selected range.
...
Added value.log() common numeric facet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@320 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 22:58:06 +00:00
David Huynh
91241539cf
Switched to a canvas-based implementation of histograms.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@319 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 20:45:52 +00:00
David Huynh
798805edc5
More styling tweaks.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@306 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 18:11:08 +00:00
Stefano Mazzocchi
ad6e8c2e0c
add the ability to browse the values of a particular cluster
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@304 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 06:53:08 +00:00
David Huynh
084a6114d7
Track freebase types of columns added with data from Freebase, so that we can later add more data based on those columns. Fixed minor bug in serialization of data extension records.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@303 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 06:18:00 +00:00
David Huynh
c6e7986206
Extend data operation is working.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@301 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 00:24:20 +00:00
David Huynh
2645c864ab
We can now suggest CVT properties.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@300 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 20:41:08 +00:00
David Huynh
c30a5126df
More work on the extend data preview dialog: columns can now be removed.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@299 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 19:44:33 +00:00
David Huynh
d0f77a5ef8
Minor layout tweak in clustering dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@298 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 18:48:39 +00:00
David Huynh
a32273de70
More work on the extend data preview dialog. Results now looks correct, but we still are not handling CVTs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@295 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-14 20:55:57 +00:00
Stefano Mazzocchi
2946f2e8c3
- renamed facet-based-edit-dialog -> clustering-dialog
...
- added help in case the clustering dialog comes up with no clusters
- changed 'remove' -> (x) button for text facet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@291 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-13 09:32:06 +00:00
David Huynh
99ae7dea29
More work on the extend data preview dialog. It's starting to render some results.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@290 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-13 08:08:25 +00:00
David Huynh
f34577ec85
Improved grid layout CSS rules.
...
Started working on extending data from Freebase feature.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@289 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-13 07:13:18 +00:00
David Huynh
c637df71c9
Factored out grid layout CSS rules.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@288 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 22:42:05 +00:00
Stefano Mazzocchi
a7d4951725
several improvements for clustering
...
- added a unicode ASCII-fiying addition to the fingerprinting functions
- removed all distance functions for kNN that didn't seem to do anything useful
- added the ability to indicate what value to use as cluster centroid by simply clicking on it
(this is useful for those names that have non-ASCII chars that might not even be on your keyboard.. and cut/paste is error prone/cumbersome)
- added a 10x multiplier to the PPM compression distance which makes it more aligned with the levenshtein ones
- made sure that we construct a phonetic fingerprint for the whole string and not just the beginning subset
(performance is still not ideal but it's now reasonable)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@268 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:45:14 +00:00
Stefano Mazzocchi
72b012971f
more polish on the clustering dialog
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@264 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 01:18:41 +00:00
Stefano Mazzocchi
0c4b79c53a
more polish on the clustering dialog
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@260 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 20:58:51 +00:00
Stefano Mazzocchi
8ce21461cb
getting closer to the desired functionality... still way too slow though
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@256 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 17:28:50 +00:00
David Huynh
311d15f493
Re-organized column header popup menus and added a bunch of common facets and common cell edit transforms.
...
Added native syntax for regex in GEL and modified replace, split, partition, and rpartition functions to support regex. Removed function replaceRegex.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@249 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 06:57:08 +00:00
Stefano Mazzocchi
5b079b04b7
- moved from float to double to avoid excessive casting from secondstring
...
- added a few of the more powerful distances
- fixed a bug in the VPTree builder (although is still not working as I expect it to)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@248 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 05:11:36 +00:00
David Huynh
af9e9f590b
Fixed minor bug in facets of facet-based edit dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@247 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:41:33 +00:00
David Huynh
4cc1933065
Got facets in facet-based edit dialog to update only after the user has finished dragging the slider bracket.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@246 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:33:07 +00:00
David Huynh
a3bcfc1576
Implemented facets on cluster metrics in facet-based edit dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@244 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 23:05:44 +00:00
David Huynh
8633b20392
Fixed layout bug in expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@242 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 21:02:04 +00:00
Stefano Mazzocchi
f7b0caa1b8
now kNN clustering is fully operational... not very practical though, needs more work and testing
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@225 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 08:27:13 +00:00
David Huynh
e0d72c81e9
Renamed "facet-based edit" operation and command to "mass edit", because it's not just facet-based.
...
Added option "apply to other cells with same original content" to single cell edit popup, so it can be used like a find&replace operation.
Renamed "do-text-transform" operation and command to just "text-transform".
git-svn-id: http://google-refine.googlecode.com/svn/trunk@223 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 00:25:00 +00:00
David Huynh
b9308e4034
Added option to apply and recluster in the facet based edit dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@222 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 23:34:27 +00:00
David Huynh
02cd59a5c0
Display cluster sizes and number of cluster in facet-based edit dialog.
...
Added command to invoke that dialog from column popup menu, so you don't have to create a text facet first to get to it.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@221 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 23:01:46 +00:00
Stefano Mazzocchi
976c1da5c7
much improved facet clustering dialog and functionality
...
NOTE: kNN clustering code operational but is not working as expected
git-svn-id: http://google-refine.googlecode.com/svn/trunk@219 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 10:17:58 +00:00
David Huynh
78b1eb7e73
Major refactoring:
...
- Made all Change classes save to and load from .zip files.
- Changed Column.headerLabel to Column.name.
- Save project's raw data to "raw-data" file for now. We'll make it save to a zip file next.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@217 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 07:43:45 +00:00
David Huynh
7697365180
Added custom suggest for types as well. Styled property and type suggest panes to be wider to accommodate long IDs.
...
Fixed event wiring bugs in recon dialog.
Made cell edit popup wider and position it relative to the whole cell rather than to the edit link.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@212 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 19:28:35 +00:00
David Huynh
7487e81a4e
Fixed bug in expression preview dialog where selecting an old expression from the history pane did not take out the language prefixing, causing the expression to err.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@195 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 07:18:23 +00:00
David Huynh
46ba4b94ca
Hooked up language select in expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@192 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:45:54 +00:00
David Huynh
70df6821a0
Made expression preview dialog for text transform operation also support repeat option.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@189 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:25:42 +00:00
David Huynh
7eaf4b098c
Use .text() don't use .html() to render preview values.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@186 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:04:56 +00:00
David Huynh
2fe8f98e4e
Added repeat and repeatCount options for text transform operation. This lets us fix those & repeated encoding problems easily.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@179 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:00:46 +00:00
David Huynh
c1498448e4
Implemented global and per-project expression histories and hooked them up to the expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@176 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 21:21:38 +00:00
David Huynh
91b10039ca
Started to redesign expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@175 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 18:08:35 +00:00
David Huynh
2741d54984
Added custom suggest widget for properties.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@170 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 23:48:21 +00:00
David Huynh
3ecfb4e4d9
Implemented facet-based edit operation for real.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@167 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 20:33:11 +00:00
David Huynh
d9507200f8
Started work on the facet-based edit dialog, which allows editing of cells based on their projections to facet choices.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@166 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 19:58:47 +00:00
David Huynh
512cd16381
Implemented recon by keys, guids, and ids.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@165 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 18:19:20 +00:00
David Huynh
e57aae888b
Hooked up the recon service at data.labs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@163 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 00:33:32 +00:00