Commit Graph

390 Commits

Author SHA1 Message Date
David Huynh
b54f7162a8 Made histogram widget capable of highlighting the selected range.
Added value.log() common numeric facet.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@320 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 22:58:06 +00:00
David Huynh
91241539cf Switched to a canvas-based implementation of histograms.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@319 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 20:45:52 +00:00
Stefano Mazzocchi
b9b4bb0ab4 better dropdown button that doesn't look disabled
git-svn-id: http://google-refine.googlecode.com/svn/trunk@318 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 01:47:03 +00:00
David Huynh
b7338e13f2 Tweaked column header menu dropdown icon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@317 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 00:35:34 +00:00
David Huynh
d56bbc1208 Renamed Protograph Node dialog's title to Schema Skeleton Node.
Made deleting a protograph link update the previews.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@316 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 22:55:40 +00:00
David Huynh
124960e756 Made "search for match" dialog commit on fb-select event.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@315 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 22:42:59 +00:00
David Huynh
07cf85b2a5 Added make_cli option for generating a zip containing all files necessary to do 'gridworks run' at the command line. This excludes Java source files but contains pretty much everything else.
Added make_all option that makes dmg, exe, and cli.

Added html and xls exporters.
Made exported files named after project names rather than project IDs.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@314 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 22:30:16 +00:00
David Huynh
07945f9cde A more helpful error message when the excel importer fails.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@313 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 21:43:58 +00:00
David Huynh
cd062cf028 Minor bug: recon candidate's "id" field should return id, not name.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@312 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 19:54:27 +00:00
David Huynh
b26160dc2b Hopefully a more robust way to get the user data dir on Windows, especially on Windows Vista 64-bit, which jdatapath.dll isn't built for.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@311 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 03:51:58 +00:00
David Huynh
b8519e42d6 Use non-breaking hyphens for "re-match" links.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@309 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 21:01:27 +00:00
David Huynh
999c18cae7 Better date/time format for projects' last modified fields.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@308 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 20:55:01 +00:00
David Huynh
1a8a236cdd Added an error page for when a project create operation fails.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@307 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 18:40:07 +00:00
David Huynh
798805edc5 More styling tweaks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@306 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 18:11:08 +00:00
David Huynh
4e262f0e1d Styling tweaks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@305 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 17:38:38 +00:00
Stefano Mazzocchi
ad6e8c2e0c add the ability to browse the values of a particular cluster
git-svn-id: http://google-refine.googlecode.com/svn/trunk@304 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 06:53:08 +00:00
David Huynh
084a6114d7 Track freebase types of columns added with data from Freebase, so that we can later add more data based on those columns. Fixed minor bug in serialization of data extension records.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@303 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 06:18:00 +00:00
Stefano Mazzocchi
cf95e5b5f6 freebase branding
git-svn-id: http://google-refine.googlecode.com/svn/trunk@302 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 01:46:32 +00:00
David Huynh
c6e7986206 Extend data operation is working.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@301 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 00:24:20 +00:00
David Huynh
2645c864ab We can now suggest CVT properties.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@300 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 20:41:08 +00:00
David Huynh
c30a5126df More work on the extend data preview dialog: columns can now be removed.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@299 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 19:44:33 +00:00
David Huynh
d0f77a5ef8 Minor layout tweak in clustering dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@298 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 18:48:39 +00:00
Stefano Mazzocchi
7137b4bdf6 make use of multiple cores when doing clustering (has a consistent performance speedup for 5000 rows or more so I enable it by default)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@297 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 07:45:30 +00:00
Stefano Mazzocchi
227b30c860 more optimizations for clustering
git-svn-id: http://google-refine.googlecode.com/svn/trunk@296 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 04:30:49 +00:00
David Huynh
a32273de70 More work on the extend data preview dialog. Results now looks correct, but we still are not handling CVTs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@295 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-14 20:55:57 +00:00
David Huynh
e35c4c3b94 Minor bug.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@294 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-14 05:15:23 +00:00
Stefano Mazzocchi
3495c417cd more fixes to the VPTree, this time it's working consistently for real
git-svn-id: http://google-refine.googlecode.com/svn/trunk@293 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-14 01:29:27 +00:00
Stefano Mazzocchi
f7ab7c9cf6 - incorporated Paolo Ciccarese's fixes for VPTrees in Vicino
- moved all clustering stuff in the vicino package space to simplify external collaboration on that code
- added "type" function to the GEL


git-svn-id: http://google-refine.googlecode.com/svn/trunk@292 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-13 09:34:17 +00:00
Stefano Mazzocchi
2946f2e8c3 - renamed facet-based-edit-dialog -> clustering-dialog
- added help in case the clustering dialog comes up with no clusters
- changed 'remove' -> (x) button for text facet


git-svn-id: http://google-refine.googlecode.com/svn/trunk@291 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-13 09:32:06 +00:00
David Huynh
99ae7dea29 More work on the extend data preview dialog. It's starting to render some results.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@290 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-13 08:08:25 +00:00
David Huynh
f34577ec85 Improved grid layout CSS rules.
Started working on extending data from Freebase feature.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@289 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-13 07:13:18 +00:00
David Huynh
c637df71c9 Factored out grid layout CSS rules.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@288 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 22:42:05 +00:00
David Huynh
67bac099f0 - Made removing facet a bit more interactive: the removed facet disappears right away.
- Made list facets limit themselves to only 2000 choices, so not to overload the browser.
- Made list and range facets handle errors in expressions better.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@287 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 20:08:50 +00:00
Stefano Mazzocchi
00a81c5fc4 make the kNN clustering report the right counts for the facet values (and order them in the clusters by counts)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@286 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 19:10:22 +00:00
Stefano Mazzocchi
d72c07b715 latest clustering fixes (the vptree is still too slow though, I'll probably abandon that approach for now)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@285 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 07:37:37 +00:00
David Huynh
58450555e9 Allow GEL identifiers to contain underscores.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@284 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 06:38:12 +00:00
David Huynh
025eccce4b Implemented "record" field for each row.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@283 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 06:33:03 +00:00
David Huynh
af3cb76056 Added support for including dependent rows in row visiting. Facets still don't count them, though.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@282 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 01:06:23 +00:00
David Huynh
7e2667ab45 Minor bug in Excel importer: we forgot to update the max cell index.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@281 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 00:23:01 +00:00
David Huynh
e760750b57 Fixed minor bug that prevented column details from getting passed on to recon service.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@280 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 21:55:32 +00:00
David Huynh
7526e07e6d TOOL-153: property suggest now allows full property IDs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@279 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 21:19:51 +00:00
David Huynh
4855f70d88 Fixed event wiring bug in index.html that prevented the project name validation from getting run, and prevented limit= and skip= params from getting sent to the server.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@278 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 20:58:42 +00:00
David Huynh
c3ebb5a9f4 Got Vishal's jython integration to work.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@277 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 19:56:43 +00:00
Vishal Talwar
4bb9e06772 adding extremely crude jython and clojure expression evaluation
git-svn-id: http://google-refine.googlecode.com/svn/trunk@276 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 23:43:21 +00:00
David Huynh
c81548529b Fixed process widget to scroll properly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@275 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 21:18:12 +00:00
David Huynh
86a8e13d88 Added "numeric" choice to numeric range facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@272 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:48:04 +00:00
David Huynh
b1fca11342 Made recon use cells from context rows.
Fixed bug in menu left-right positioning.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@271 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:32:20 +00:00
David Huynh
432e88a23b Fixed range facet to restore non-numeric, blank, and error selections from its ui state properly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@270 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:15:01 +00:00
David Huynh
f02fd3f5c3 Try to keep the same scrolling position in the data table view when it gets re-rendered.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@269 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:51:34 +00:00
Stefano Mazzocchi
a7d4951725 several improvements for clustering
- added a unicode ASCII-fiying addition to the fingerprinting functions
 - removed all distance functions for kNN that didn't seem to do anything useful
 - added the ability to indicate what value to use as cluster centroid by simply clicking on it
 (this is useful for those names that have non-ASCII chars that might not even be on your keyboard.. and cut/paste is error prone/cumbersome)
 - added a 10x multiplier to the PPM compression distance which makes it more aligned with the levenshtein ones
 - made sure that we construct a phonetic fingerprint for the whole string and not just the beginning subset
(performance is still not ideal but it's now reasonable)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@268 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:45:14 +00:00
David Huynh
6bf5418f9d Cell changes should also flush column precomputes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@267 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:42:57 +00:00
David Huynh
0160b6841d Fixed data table view bugs: collapsed columns should now stay collapsed even if the column model changes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@266 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:12:09 +00:00
David Huynh
e008332399 - make recon changes flush column precomputes
- fixed bug where recon features are not saved to file properly
- support selecting non-numeric, blank, and error choices in numeric range facets

git-svn-id: http://google-refine.googlecode.com/svn/trunk@265 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 06:02:36 +00:00
Stefano Mazzocchi
72b012971f more polish on the clustering dialog
git-svn-id: http://google-refine.googlecode.com/svn/trunk@264 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 01:18:41 +00:00
David Huynh
1000d63539 In the range facet, although we don't want to update the facets until the user stops dragging, we still want to update the selection indicators from-to during the dragging.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@263 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 22:53:53 +00:00
Stefano Mazzocchi
c03b223a78 fixed another NPE bug
git-svn-id: http://google-refine.googlecode.com/svn/trunk@262 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 22:43:49 +00:00
David Huynh
51b38a4eed Fixed minor bug in binning clusterer.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@261 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 21:28:40 +00:00
Stefano Mazzocchi
0c4b79c53a more polish on the clustering dialog
git-svn-id: http://google-refine.googlecode.com/svn/trunk@260 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 20:58:51 +00:00
David Huynh
6b3a20dc46 Check for null confidence string.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@259 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 20:57:46 +00:00
David Huynh
2bac6844e2 Fixed csv importer to handle escaped quotation marks ("").
git-svn-id: http://google-refine.googlecode.com/svn/trunk@257 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 19:10:55 +00:00
Stefano Mazzocchi
8ce21461cb getting closer to the desired functionality... still way too slow though
git-svn-id: http://google-refine.googlecode.com/svn/trunk@256 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 17:28:50 +00:00
Stefano Mazzocchi
50e58fb863 ngram-blocking gives more expected results... but slow as hell, maybe bug in the vptree code?
git-svn-id: http://google-refine.googlecode.com/svn/trunk@255 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 09:05:20 +00:00
Stefano Mazzocchi
546f87a536 let's try with another knn method
git-svn-id: http://google-refine.googlecode.com/svn/trunk@254 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 08:09:35 +00:00
David Huynh
977dbdb9ed Fixed numeric range facet:
- use X icon for facet remove button
- update facet only when user finishes dragging slider bracket

git-svn-id: http://google-refine.googlecode.com/svn/trunk@250 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 07:01:13 +00:00
David Huynh
311d15f493 Re-organized column header popup menus and added a bunch of common facets and common cell edit transforms.
Added native syntax for regex in GEL and modified replace, split, partition, and rpartition functions to support regex. Removed function replaceRegex.


git-svn-id: http://google-refine.googlecode.com/svn/trunk@249 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 06:57:08 +00:00
Stefano Mazzocchi
5b079b04b7 - moved from float to double to avoid excessive casting from secondstring
- added a few of the more powerful distances
- fixed a bug in the VPTree builder (although is still not working as I expect it to)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@248 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 05:11:36 +00:00
David Huynh
af9e9f590b Fixed minor bug in facets of facet-based edit dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@247 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:41:33 +00:00
David Huynh
4cc1933065 Got facets in facet-based edit dialog to update only after the user has finished dragging the slider bracket.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@246 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:33:07 +00:00
David Huynh
562b9d67a2 Customized brackets of slider widget so that they are asymmetric.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@245 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:14:01 +00:00
David Huynh
a3bcfc1576 Implemented facets on cluster metrics in facet-based edit dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@244 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 23:05:44 +00:00
David Huynh
ff94de5900 Made dialogs draggable.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@243 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 21:50:53 +00:00
David Huynh
8633b20392 Fixed layout bug in expression preview dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@242 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 21:02:04 +00:00
David Huynh
6b421c2c75 In property suggest, bubble up properties of included types as well.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@241 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 20:56:04 +00:00
David Huynh
4a4ae6bf27 Fixed toTitlecase to handle parentheses and other delimiters.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@240 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 19:40:51 +00:00
David Huynh
c147837a3e Cell and facet choice edit popups now allow Shift-Enter as a means to insert new lines. Plain Enter still applies the edit immediately.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@239 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 19:27:37 +00:00
David Huynh
0ef0aec0c5 Implemented list facet choice edit.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@238 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 19:07:16 +00:00
David Huynh
d731b89b4c Back to Tahoma as the main font.
Tried more branding colors on dialog header background and tab widget header background.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@237 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 07:21:46 +00:00
David Huynh
6472e1f076 Re-layout when window is resized.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@236 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 06:48:12 +00:00
David Huynh
ac50b3c48b Re-worked the cell editor popup.
Don't keep logging "Saved workspace."

git-svn-id: http://google-refine.googlecode.com/svn/trunk@235 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 06:30:47 +00:00
David Huynh
5d3a57eeeb Implemented project import and export commands (from/to .tar files).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@234 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 02:34:25 +00:00
David Huynh
12d5c6aba5 Fixed layout of extract operation dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@233 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 00:45:29 +00:00
David Huynh
a1ec0ea8df When saving projects, save only modified ones.
Save projects and workspace periodically.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@232 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 00:37:06 +00:00
David Huynh
3388c3e09f Still some old Serializable stuff to remove.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@228 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 23:02:57 +00:00
David Huynh
80e6111a92 Added options for omitting error and blank choices in list facets, and use them in the various recon facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@227 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 22:54:02 +00:00
David Huynh
694f09fb0a Major refactoring: everything is now saved to disk using our own formats, mostly json-based, some inside zip files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@226 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 22:37:26 +00:00
Stefano Mazzocchi
f7b0caa1b8 now kNN clustering is fully operational... not very practical though, needs more work and testing
git-svn-id: http://google-refine.googlecode.com/svn/trunk@225 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 08:27:13 +00:00
David Huynh
e06d8fe130 Better checking for null value in Cell.load.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@224 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 00:35:44 +00:00
David Huynh
e0d72c81e9 Renamed "facet-based edit" operation and command to "mass edit", because it's not just facet-based.
Added option "apply to other cells with same original content" to single cell edit popup, so it can be used like a find&replace operation.
Renamed "do-text-transform" operation and command to just "text-transform".

git-svn-id: http://google-refine.googlecode.com/svn/trunk@223 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 00:25:00 +00:00
David Huynh
b9308e4034 Added option to apply and recluster in the facet based edit dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@222 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 23:34:27 +00:00
David Huynh
02cd59a5c0 Display cluster sizes and number of cluster in facet-based edit dialog.
Added command to invoke that dialog from column popup menu, so you don't have to create a text facet first to get to it.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@221 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 23:01:46 +00:00
David Huynh
253874b1a1 Got Clusterer to use Column.name rather than Column.headerLabel now.
Tried using Verdana instead of Tahoma as the common font.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@220 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 22:48:42 +00:00
Stefano Mazzocchi
976c1da5c7 much improved facet clustering dialog and functionality
NOTE: kNN clustering code operational but is not working as expected


git-svn-id: http://google-refine.googlecode.com/svn/trunk@219 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 10:17:58 +00:00
David Huynh
db824bffeb Fixed bug in saving recon changes.
Fixed bug in discard recon judgment operation.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@218 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 08:03:29 +00:00
David Huynh
78b1eb7e73 Major refactoring:
- Made all Change classes save to and load from .zip files.
- Changed Column.headerLabel to Column.name.
- Save project's raw data to "raw-data" file for now. We'll make it save to a zip file next.


git-svn-id: http://google-refine.googlecode.com/svn/trunk@217 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 07:43:45 +00:00
David Huynh
589b9cd936 Re-organized popup menus for row operations. Added filter row.starred.
Disabled rendering of key column and column groups for now.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@216 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 22:31:47 +00:00
David Huynh
fe78fb8e30 A bit of branding and re-laying out the front page.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@215 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 22:09:25 +00:00
David Huynh
5c845f06bf Now we can delete a project even if it hasn't been saved to file yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@214 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 19:43:08 +00:00
David Huynh
676a189968 Re-organized the .css files to match the .js files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@213 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 19:36:55 +00:00
David Huynh
7697365180 Added custom suggest for types as well. Styled property and type suggest panes to be wider to accommodate long IDs.
Fixed event wiring bugs in recon dialog.
Made cell edit popup wider and position it relative to the whole cell rather than to the edit link.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@212 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 19:28:35 +00:00
David Huynh
f1bf446019 Forgot to set white-space:pre on single cell edit popup.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@211 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 08:31:59 +00:00
David Huynh
b3ac945c33 Implemented single-cell editing.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@210 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 08:11:48 +00:00
Stefano Mazzocchi
808fe61c8c remove stuff that doesn't belong in version control
git-svn-id: http://google-refine.googlecode.com/svn/trunk@209 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 07:43:22 +00:00
David Huynh
40cdf5092b Better display of Calendar objects in data table view and in expression preview dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@208 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 02:25:27 +00:00
David Huynh
87d20f3299 Fixed minor bug in numeric bin index where if a value was infinity, the bin count went negative.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@207 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 02:10:24 +00:00
Stefano Mazzocchi
37e37488ec ability to delete a project from the front page
git-svn-id: http://google-refine.googlecode.com/svn/trunk@206 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:52:55 +00:00
David Huynh
2227dc8686 Delay injecting facet body into the DOM to speed things up a little.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@205 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:51:01 +00:00
David Huynh
4d893271ca Fixed minor bug in cell UI that prevented single recon judgment operation from updating the UI.
Minor optimization: inject main table into the DOM only after it's completely built.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@204 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:42:21 +00:00
David Huynh
1d6db8fa6e Made recon process cause the client page to create facets when the recon process is done.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@203 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:13:59 +00:00
Stefano Mazzocchi
32c0bf08c9 adding now() and inc() functions to the gel
git-svn-id: http://google-refine.googlecode.com/svn/trunk@202 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 20:53:07 +00:00
David Huynh
9d8b746121 Switched Cell.value from Object to Serializable.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@201 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 19:59:31 +00:00
David Huynh
3e0ac50e17 Fixed date parsing bug in index.js introduced since last commit.
Removed debugging console.log() call in browsing-engine.js.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@200 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 19:38:23 +00:00
David Huynh
1dc3d4abbd Save project metadata to disk as JSON now rather than through Java serialization API.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@199 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 19:15:46 +00:00
Stefano Mazzocchi
409b451085 started work on protocol buffers
git-svn-id: http://google-refine.googlecode.com/svn/trunk@197 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 08:46:33 +00:00
David Huynh
22f226358d Added pre-canned facets isBlank(value) and isError(value).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@196 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 07:32:30 +00:00
David Huynh
7487e81a4e Fixed bug in expression preview dialog where selecting an old expression from the history pane did not take out the language prefixing, causing the expression to err.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@195 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 07:18:23 +00:00
David Huynh
6811f54f31 Fixed quoting bug in tripleloader transposer.
Implemented tripleloader exporter.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@194 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 07:02:03 +00:00
Stefano Mazzocchi
8f01da0aa8 fixing the date parser
git-svn-id: http://google-refine.googlecode.com/svn/trunk@193 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 04:06:38 +00:00
David Huynh
46ba4b94ca Hooked up language select in expression preview dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@192 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:45:54 +00:00
Stefano Mazzocchi
1695e2f8f1 add the ngramFingerprint function
git-svn-id: http://google-refine.googlecode.com/svn/trunk@191 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:37:25 +00:00
Stefano Mazzocchi
5c3ca7723a use a TreeSet to do both sorting and de-dupe of the split fragments
git-svn-id: http://google-refine.googlecode.com/svn/trunk@190 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:37:06 +00:00
David Huynh
70df6821a0 Made expression preview dialog for text transform operation also support repeat option.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@189 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:25:42 +00:00
Stefano Mazzocchi
cde6a02cbb typo (needed to escape ' which is actually *not* an HTML entity, who knew)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@188 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:24:06 +00:00
David Huynh
87956be756 Minor bug: don't try to bind null cell value.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@187 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:14:40 +00:00
David Huynh
7eaf4b098c Use .text() don't use .html() to render preview values.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@186 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:04:56 +00:00
David Huynh
9ca456411c Use keyup instead of keypress, because when keypress gets fired, the new value isn't set yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@185 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 01:50:46 +00:00
David Huynh
1f05954924 Fixed regex text search facet to handle errors better. Use .text() rather than .html() to render cell values, or & will not show up.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@184 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 01:47:58 +00:00
David Huynh
72d06fe65c Added support for canceling running and pending processes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@183 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 01:14:48 +00:00
David Huynh
eaef7b2394 Also let user decide what to do on expression evaluation error when creating a new column.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@182 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:32:54 +00:00
Stefano Mazzocchi
c07431fb88 - cataloged all the licenses for the libraries Gridworks depends on
- added the secondstring libraries that contains all sorts of useful string distance functions
- added a java arithmetic coding library (used to implement a string distance based on PPM arithmetic coding)
- added the vicino kNN string clustering library (from MIT's SIMILE)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@181 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:29:52 +00:00
David Huynh
5a0a8bea4f Added custom dialog box for create column operation (with a field for the new column name).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@180 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:12:39 +00:00
David Huynh
2fe8f98e4e Added repeat and repeatCount options for text transform operation. This lets us fix those & repeated encoding problems easily.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@179 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:00:46 +00:00
David Huynh
b4d2cef526 Added an option for what to do when a text transform errors out. Made a custom expression preview dialog for the text transform command in order to suppor that option.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@178 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 22:12:48 +00:00
David Huynh
a3181bbd8f Made protograph UI link widget use the parent node's expected type to configure its property suggest.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@177 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 21:30:46 +00:00
David Huynh
c1498448e4 Implemented global and per-project expression histories and hooked them up to the expression preview dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@176 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 21:21:38 +00:00
David Huynh
91b10039ca Started to redesign expression preview dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@175 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 18:08:35 +00:00
David Huynh
b75f1faea8 Changed tabs to spaces. No functionality change.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@174 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 04:19:58 +00:00
Stefano Mazzocchi
2691ee50d7 adding OS-specific data paths
git-svn-id: http://google-refine.googlecode.com/svn/trunk@173 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 02:53:07 +00:00
David Huynh
ad7671508f Added "cancel processes" command, not hooked up yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@171 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 00:30:39 +00:00
David Huynh
2741d54984 Added custom suggest widget for properties.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@170 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 23:48:21 +00:00
David Huynh
59c5314e42 Fixed bug in list facet: list facets on columns with numeric data weren't working before.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@169 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 23:07:33 +00:00
David Huynh
b488d093c8 Added indicator for facet refreshing, and control for forcing facets to refresh.
Made facets drag-and-drop-able.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@168 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 22:32:52 +00:00
David Huynh
3ecfb4e4d9 Implemented facet-based edit operation for real.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@167 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 20:33:11 +00:00
David Huynh
d9507200f8 Started work on the facet-based edit dialog, which allows editing of cells based on their projections to facet choices.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@166 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 19:58:47 +00:00
David Huynh
512cd16381 Implemented recon by keys, guids, and ids.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@165 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 18:19:20 +00:00
David Huynh
99ae6109d8 Started work on key-based recon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@164 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 03:31:58 +00:00
David Huynh
e57aae888b Hooked up the recon service at data.labs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@163 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 00:33:32 +00:00
David Huynh
f16727c20c Refactored recon code on the server side to prepare for supporting other modes of recon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@162 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 23:33:23 +00:00
David Huynh
a17882c44f Redesigned the recon dialog. New features are not implemented yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@161 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 21:38:50 +00:00
Stefano Mazzocchi
621655372f - save encoding and confidence in the project metadata
- use the saved encoding for decoding
- don't error when fingerprinting null


git-svn-id: http://google-refine.googlecode.com/svn/trunk@160 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 04:56:16 +00:00
David Huynh
bc9bc54d30 Implemented a meta parser that looks for a language prefix and picks the right parser.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@159 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:50:56 +00:00