David Huynh
af3cb76056
Added support for including dependent rows in row visiting. Facets still don't count them, though.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@282 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 01:06:23 +00:00
David Huynh
7e2667ab45
Minor bug in Excel importer: we forgot to update the max cell index.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@281 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 00:23:01 +00:00
David Huynh
e760750b57
Fixed minor bug that prevented column details from getting passed on to recon service.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@280 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 21:55:32 +00:00
David Huynh
7526e07e6d
TOOL-153: property suggest now allows full property IDs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@279 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 21:19:51 +00:00
David Huynh
4855f70d88
Fixed event wiring bug in index.html that prevented the project name validation from getting run, and prevented limit= and skip= params from getting sent to the server.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@278 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 20:58:42 +00:00
David Huynh
c3ebb5a9f4
Got Vishal's jython integration to work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@277 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 19:56:43 +00:00
Vishal Talwar
4bb9e06772
adding extremely crude jython and clojure expression evaluation
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@276 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 23:43:21 +00:00
David Huynh
c81548529b
Fixed process widget to scroll properly.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@275 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 21:18:12 +00:00
David Huynh
2094ded82c
Updated site index.html to link to 1.0b files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@273 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:56:38 +00:00
David Huynh
86a8e13d88
Added "numeric" choice to numeric range facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@272 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:48:04 +00:00
David Huynh
b1fca11342
Made recon use cells from context rows.
...
Fixed bug in menu left-right positioning.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@271 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:32:20 +00:00
David Huynh
432e88a23b
Fixed range facet to restore non-numeric, blank, and error selections from its ui state properly.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@270 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:15:01 +00:00
David Huynh
f02fd3f5c3
Try to keep the same scrolling position in the data table view when it gets re-rendered.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@269 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:51:34 +00:00
Stefano Mazzocchi
a7d4951725
several improvements for clustering
...
- added a unicode ASCII-fiying addition to the fingerprinting functions
- removed all distance functions for kNN that didn't seem to do anything useful
- added the ability to indicate what value to use as cluster centroid by simply clicking on it
(this is useful for those names that have non-ASCII chars that might not even be on your keyboard.. and cut/paste is error prone/cumbersome)
- added a 10x multiplier to the PPM compression distance which makes it more aligned with the levenshtein ones
- made sure that we construct a phonetic fingerprint for the whole string and not just the beginning subset
(performance is still not ideal but it's now reasonable)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@268 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:45:14 +00:00
David Huynh
6bf5418f9d
Cell changes should also flush column precomputes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@267 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:42:57 +00:00
David Huynh
0160b6841d
Fixed data table view bugs: collapsed columns should now stay collapsed even if the column model changes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@266 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:12:09 +00:00
David Huynh
e008332399
- make recon changes flush column precomputes
...
- fixed bug where recon features are not saved to file properly
- support selecting non-numeric, blank, and error choices in numeric range facets
git-svn-id: http://google-refine.googlecode.com/svn/trunk@265 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 06:02:36 +00:00
Stefano Mazzocchi
72b012971f
more polish on the clustering dialog
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@264 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 01:18:41 +00:00
David Huynh
1000d63539
In the range facet, although we don't want to update the facets until the user stops dragging, we still want to update the selection indicators from-to during the dragging.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@263 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 22:53:53 +00:00
Stefano Mazzocchi
c03b223a78
fixed another NPE bug
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@262 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 22:43:49 +00:00
David Huynh
51b38a4eed
Fixed minor bug in binning clusterer.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@261 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 21:28:40 +00:00
Stefano Mazzocchi
0c4b79c53a
more polish on the clustering dialog
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@260 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 20:58:51 +00:00
David Huynh
6b3a20dc46
Check for null confidence string.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@259 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 20:57:46 +00:00
David Huynh
2bac6844e2
Fixed csv importer to handle escaped quotation marks ("").
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@257 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 19:10:55 +00:00
Stefano Mazzocchi
8ce21461cb
getting closer to the desired functionality... still way too slow though
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@256 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 17:28:50 +00:00
Stefano Mazzocchi
50e58fb863
ngram-blocking gives more expected results... but slow as hell, maybe bug in the vptree code?
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@255 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 09:05:20 +00:00
Stefano Mazzocchi
546f87a536
let's try with another knn method
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@254 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 08:09:35 +00:00
David Huynh
977dbdb9ed
Fixed numeric range facet:
...
- use X icon for facet remove button
- update facet only when user finishes dragging slider bracket
git-svn-id: http://google-refine.googlecode.com/svn/trunk@250 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 07:01:13 +00:00
David Huynh
311d15f493
Re-organized column header popup menus and added a bunch of common facets and common cell edit transforms.
...
Added native syntax for regex in GEL and modified replace, split, partition, and rpartition functions to support regex. Removed function replaceRegex.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@249 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 06:57:08 +00:00
Stefano Mazzocchi
5b079b04b7
- moved from float to double to avoid excessive casting from secondstring
...
- added a few of the more powerful distances
- fixed a bug in the VPTree builder (although is still not working as I expect it to)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@248 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 05:11:36 +00:00
David Huynh
af9e9f590b
Fixed minor bug in facets of facet-based edit dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@247 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:41:33 +00:00
David Huynh
4cc1933065
Got facets in facet-based edit dialog to update only after the user has finished dragging the slider bracket.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@246 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:33:07 +00:00
David Huynh
562b9d67a2
Customized brackets of slider widget so that they are asymmetric.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@245 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-09 00:14:01 +00:00
David Huynh
a3bcfc1576
Implemented facets on cluster metrics in facet-based edit dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@244 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 23:05:44 +00:00
David Huynh
ff94de5900
Made dialogs draggable.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@243 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 21:50:53 +00:00
David Huynh
8633b20392
Fixed layout bug in expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@242 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 21:02:04 +00:00
David Huynh
6b421c2c75
In property suggest, bubble up properties of included types as well.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@241 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 20:56:04 +00:00
David Huynh
4a4ae6bf27
Fixed toTitlecase to handle parentheses and other delimiters.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@240 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 19:40:51 +00:00
David Huynh
c147837a3e
Cell and facet choice edit popups now allow Shift-Enter as a means to insert new lines. Plain Enter still applies the edit immediately.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@239 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 19:27:37 +00:00
David Huynh
0ef0aec0c5
Implemented list facet choice edit.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@238 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 19:07:16 +00:00
David Huynh
d731b89b4c
Back to Tahoma as the main font.
...
Tried more branding colors on dialog header background and tab widget header background.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@237 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 07:21:46 +00:00
David Huynh
6472e1f076
Re-layout when window is resized.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@236 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 06:48:12 +00:00
David Huynh
ac50b3c48b
Re-worked the cell editor popup.
...
Don't keep logging "Saved workspace."
git-svn-id: http://google-refine.googlecode.com/svn/trunk@235 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 06:30:47 +00:00
David Huynh
5d3a57eeeb
Implemented project import and export commands (from/to .tar files).
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@234 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 02:34:25 +00:00
David Huynh
12d5c6aba5
Fixed layout of extract operation dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@233 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 00:45:29 +00:00
David Huynh
a1ec0ea8df
When saving projects, save only modified ones.
...
Save projects and workspace periodically.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@232 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 00:37:06 +00:00
David Huynh
3388c3e09f
Still some old Serializable stuff to remove.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@228 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 23:02:57 +00:00
David Huynh
80e6111a92
Added options for omitting error and blank choices in list facets, and use them in the various recon facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@227 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 22:54:02 +00:00
David Huynh
694f09fb0a
Major refactoring: everything is now saved to disk using our own formats, mostly json-based, some inside zip files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@226 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 22:37:26 +00:00
Stefano Mazzocchi
f7b0caa1b8
now kNN clustering is fully operational... not very practical though, needs more work and testing
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@225 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 08:27:13 +00:00
David Huynh
e06d8fe130
Better checking for null value in Cell.load.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@224 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 00:35:44 +00:00
David Huynh
e0d72c81e9
Renamed "facet-based edit" operation and command to "mass edit", because it's not just facet-based.
...
Added option "apply to other cells with same original content" to single cell edit popup, so it can be used like a find&replace operation.
Renamed "do-text-transform" operation and command to just "text-transform".
git-svn-id: http://google-refine.googlecode.com/svn/trunk@223 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-07 00:25:00 +00:00
David Huynh
b9308e4034
Added option to apply and recluster in the facet based edit dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@222 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 23:34:27 +00:00
David Huynh
02cd59a5c0
Display cluster sizes and number of cluster in facet-based edit dialog.
...
Added command to invoke that dialog from column popup menu, so you don't have to create a text facet first to get to it.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@221 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 23:01:46 +00:00
David Huynh
253874b1a1
Got Clusterer to use Column.name rather than Column.headerLabel now.
...
Tried using Verdana instead of Tahoma as the common font.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@220 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 22:48:42 +00:00
Stefano Mazzocchi
976c1da5c7
much improved facet clustering dialog and functionality
...
NOTE: kNN clustering code operational but is not working as expected
git-svn-id: http://google-refine.googlecode.com/svn/trunk@219 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 10:17:58 +00:00
David Huynh
db824bffeb
Fixed bug in saving recon changes.
...
Fixed bug in discard recon judgment operation.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@218 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 08:03:29 +00:00
David Huynh
78b1eb7e73
Major refactoring:
...
- Made all Change classes save to and load from .zip files.
- Changed Column.headerLabel to Column.name.
- Save project's raw data to "raw-data" file for now. We'll make it save to a zip file next.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@217 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-06 07:43:45 +00:00
David Huynh
589b9cd936
Re-organized popup menus for row operations. Added filter row.starred.
...
Disabled rendering of key column and column groups for now.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@216 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 22:31:47 +00:00
David Huynh
fe78fb8e30
A bit of branding and re-laying out the front page.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@215 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 22:09:25 +00:00
David Huynh
5c845f06bf
Now we can delete a project even if it hasn't been saved to file yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@214 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 19:43:08 +00:00
David Huynh
676a189968
Re-organized the .css files to match the .js files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@213 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 19:36:55 +00:00
David Huynh
7697365180
Added custom suggest for types as well. Styled property and type suggest panes to be wider to accommodate long IDs.
...
Fixed event wiring bugs in recon dialog.
Made cell edit popup wider and position it relative to the whole cell rather than to the edit link.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@212 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 19:28:35 +00:00
David Huynh
f1bf446019
Forgot to set white-space:pre on single cell edit popup.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@211 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 08:31:59 +00:00
David Huynh
b3ac945c33
Implemented single-cell editing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@210 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 08:11:48 +00:00
Stefano Mazzocchi
808fe61c8c
remove stuff that doesn't belong in version control
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@209 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 07:43:22 +00:00
David Huynh
40cdf5092b
Better display of Calendar objects in data table view and in expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@208 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 02:25:27 +00:00
David Huynh
87d20f3299
Fixed minor bug in numeric bin index where if a value was infinity, the bin count went negative.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@207 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 02:10:24 +00:00
Stefano Mazzocchi
37e37488ec
ability to delete a project from the front page
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@206 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:52:55 +00:00
David Huynh
2227dc8686
Delay injecting facet body into the DOM to speed things up a little.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@205 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:51:01 +00:00
David Huynh
4d893271ca
Fixed minor bug in cell UI that prevented single recon judgment operation from updating the UI.
...
Minor optimization: inject main table into the DOM only after it's completely built.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@204 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:42:21 +00:00
David Huynh
1d6db8fa6e
Made recon process cause the client page to create facets when the recon process is done.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@203 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-05 01:13:59 +00:00
Stefano Mazzocchi
32c0bf08c9
adding now() and inc() functions to the gel
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@202 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 20:53:07 +00:00
David Huynh
9d8b746121
Switched Cell.value from Object to Serializable.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@201 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 19:59:31 +00:00
David Huynh
3e0ac50e17
Fixed date parsing bug in index.js introduced since last commit.
...
Removed debugging console.log() call in browsing-engine.js.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@200 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 19:38:23 +00:00
David Huynh
1dc3d4abbd
Save project metadata to disk as JSON now rather than through Java serialization API.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@199 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 19:15:46 +00:00
Stefano Mazzocchi
409b451085
started work on protocol buffers
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@197 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 08:46:33 +00:00
David Huynh
22f226358d
Added pre-canned facets isBlank(value) and isError(value).
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@196 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 07:32:30 +00:00
David Huynh
7487e81a4e
Fixed bug in expression preview dialog where selecting an old expression from the history pane did not take out the language prefixing, causing the expression to err.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@195 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 07:18:23 +00:00
David Huynh
6811f54f31
Fixed quoting bug in tripleloader transposer.
...
Implemented tripleloader exporter.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@194 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 07:02:03 +00:00
Stefano Mazzocchi
8f01da0aa8
fixing the date parser
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@193 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 04:06:38 +00:00
David Huynh
46ba4b94ca
Hooked up language select in expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@192 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:45:54 +00:00
Stefano Mazzocchi
1695e2f8f1
add the ngramFingerprint function
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@191 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:37:25 +00:00
Stefano Mazzocchi
5c3ca7723a
use a TreeSet to do both sorting and de-dupe of the split fragments
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@190 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:37:06 +00:00
David Huynh
70df6821a0
Made expression preview dialog for text transform operation also support repeat option.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@189 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:25:42 +00:00
Stefano Mazzocchi
cde6a02cbb
typo (needed to escape ' which is actually *not* an HTML entity, who knew)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@188 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:24:06 +00:00
David Huynh
87956be756
Minor bug: don't try to bind null cell value.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@187 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:14:40 +00:00
David Huynh
7eaf4b098c
Use .text() don't use .html() to render preview values.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@186 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 02:04:56 +00:00
David Huynh
9ca456411c
Use keyup instead of keypress, because when keypress gets fired, the new value isn't set yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@185 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 01:50:46 +00:00
David Huynh
1f05954924
Fixed regex text search facet to handle errors better. Use .text() rather than .html() to render cell values, or & will not show up.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@184 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 01:47:58 +00:00
David Huynh
72d06fe65c
Added support for canceling running and pending processes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@183 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 01:14:48 +00:00
David Huynh
eaef7b2394
Also let user decide what to do on expression evaluation error when creating a new column.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@182 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:32:54 +00:00
Stefano Mazzocchi
c07431fb88
- cataloged all the licenses for the libraries Gridworks depends on
...
- added the secondstring libraries that contains all sorts of useful string distance functions
- added a java arithmetic coding library (used to implement a string distance based on PPM arithmetic coding)
- added the vicino kNN string clustering library (from MIT's SIMILE)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@181 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:29:52 +00:00
David Huynh
5a0a8bea4f
Added custom dialog box for create column operation (with a field for the new column name).
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@180 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:12:39 +00:00
David Huynh
2fe8f98e4e
Added repeat and repeatCount options for text transform operation. This lets us fix those & repeated encoding problems easily.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@179 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-04 00:00:46 +00:00
David Huynh
b4d2cef526
Added an option for what to do when a text transform errors out. Made a custom expression preview dialog for the text transform command in order to suppor that option.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@178 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 22:12:48 +00:00
David Huynh
a3181bbd8f
Made protograph UI link widget use the parent node's expected type to configure its property suggest.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@177 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 21:30:46 +00:00
David Huynh
c1498448e4
Implemented global and per-project expression histories and hooked them up to the expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@176 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 21:21:38 +00:00
David Huynh
91b10039ca
Started to redesign expression preview dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@175 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 18:08:35 +00:00
David Huynh
b75f1faea8
Changed tabs to spaces. No functionality change.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@174 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 04:19:58 +00:00
Stefano Mazzocchi
2691ee50d7
adding OS-specific data paths
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@173 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 02:53:07 +00:00
David Huynh
ad7671508f
Added "cancel processes" command, not hooked up yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@171 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-03 00:30:39 +00:00
David Huynh
2741d54984
Added custom suggest widget for properties.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@170 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 23:48:21 +00:00
David Huynh
59c5314e42
Fixed bug in list facet: list facets on columns with numeric data weren't working before.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@169 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 23:07:33 +00:00
David Huynh
b488d093c8
Added indicator for facet refreshing, and control for forcing facets to refresh.
...
Made facets drag-and-drop-able.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@168 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 22:32:52 +00:00
David Huynh
3ecfb4e4d9
Implemented facet-based edit operation for real.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@167 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 20:33:11 +00:00
David Huynh
d9507200f8
Started work on the facet-based edit dialog, which allows editing of cells based on their projections to facet choices.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@166 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 19:58:47 +00:00
David Huynh
512cd16381
Implemented recon by keys, guids, and ids.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@165 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 18:19:20 +00:00
David Huynh
99ae6109d8
Started work on key-based recon.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@164 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 03:31:58 +00:00
David Huynh
e57aae888b
Hooked up the recon service at data.labs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@163 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-02 00:33:32 +00:00
David Huynh
f16727c20c
Refactored recon code on the server side to prepare for supporting other modes of recon.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@162 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 23:33:23 +00:00
David Huynh
a17882c44f
Redesigned the recon dialog. New features are not implemented yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@161 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 21:38:50 +00:00
Stefano Mazzocchi
621655372f
- save encoding and confidence in the project metadata
...
- use the saved encoding for decoding
- don't error when fingerprinting null
git-svn-id: http://google-refine.googlecode.com/svn/trunk@160 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 04:56:16 +00:00
David Huynh
bc9bc54d30
Implemented a meta parser that looks for a language prefix and picks the right parser.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@159 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:50:56 +00:00
David Huynh
acfa19a683
Moved GEL stuff (gridworks expression language) into gel package.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@158 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:30:31 +00:00
David Huynh
7c38fbb945
Created an ast package for gridworks expression language abstract syntax tree nodes. Moved parsing exception class out to its own file.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@156 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 01:14:21 +00:00
David Huynh
983be19e14
Made EvalError serializable because errors can be cell values and need to be saved.
...
Turned is* functions into controls, since they have to be able to test errors, and only controls can do that, not functions.
Polished display of errors in cells and in expression preview dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@155 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-01 00:21:13 +00:00
Stefano Mazzocchi
0c6590fe2c
- added an encoding guesser
...
- fixed a bunch of encoding issues
- added a function to reinterpret call content in another encoding
- added a 'phonetic' function to the expression language that supports metaphone and soundex
- updated the COS library to the latest released version
- added the IBM ICU4j library (that contains the encoding guesser)
- added examples with same content but different encodings
git-svn-id: http://google-refine.googlecode.com/svn/trunk@154 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-28 21:51:33 +00:00
Stefano Mazzocchi
d9e67ac806
- diff now can act before two dates (still to be fully tested)
...
- added string fingerprinting function (useful for clustering)
- fixed unicode() function which wasn't returning correct values
- added a toString method to EvalError to know what error that was
- fixed a NPE in TextTransformationOperation
git-svn-id: http://google-refine.googlecode.com/svn/trunk@153 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-28 00:55:09 +00:00
Stefano Mazzocchi
f1923758e7
- add a bunch of new functions
...
- very lax date parser
- lots of new advanced string functions
- new version of commons-lang
git-svn-id: http://google-refine.googlecode.com/svn/trunk@152 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 08:56:04 +00:00
David Huynh
25fd5794cd
Added choices blank and error to list facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@151 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 06:59:55 +00:00
David Huynh
49e7241d1d
Re-organized functions into a few sub-packages.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@150 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 05:59:09 +00:00
David Huynh
c914aa6c16
Introduced EvalError objects as possible values returned by expressions.
...
Extracted function and control name mappings to ControlFunctionRegistry.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@148 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 05:48:33 +00:00
David Huynh
f0b8268809
Implemented permanent link, at least for facets' states.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@147 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-27 00:16:44 +00:00
David Huynh
f5ff9044cf
Track and display recon stats in column headers.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@146 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 23:33:16 +00:00
David Huynh
aadf98a388
Show choice count in list facet. Use close.png for close button.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@145 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 22:24:25 +00:00
David Huynh
c601bb6b0a
Re-organized .js files a bit.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@144 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 22:07:54 +00:00
David Huynh
30dce3b3d5
Made range facet more robust against bad expressions.
...
Centralized code that updates components of the UI. Show "Working..." indicator if anything takes more than 500ms.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@142 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 21:56:41 +00:00
David Huynh
1e4b9f4e80
Fixed bug in text search facet where if the query is null or empty string it'd filter to nothing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@141 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-26 18:44:36 +00:00
David Huynh
e2ccd491c5
New dmg background.
...
Added download site html.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@137 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-24 23:28:24 +00:00
David Huynh
c81913c6f8
Make sure process widget doesn't get stuck.
...
Enforce maximum height on process widget.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@136 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-24 19:48:11 +00:00
David Huynh
bb83dcda1c
Added support for specifying number of initial rows to skip when creating a new project.
...
Fixed the height of the histogram images in range facets to eliminate jitters.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@135 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-24 18:52:54 +00:00
David Huynh
254853b51d
Added reverse and sort functions.
...
Support a limit on how many rows to load into a new project.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@134 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 23:22:02 +00:00
David Huynh
e449617960
Fixed minor bug introduced in last change: recon candidates didn't get rendered at all.
...
Added "refresh" link for list facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@133 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 20:20:14 +00:00
David Huynh
2eb3a67125
Styled data table view summary a bit to highlight current number of rows.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@132 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 19:18:25 +00:00
David Huynh
4bdb2320b7
Styled help tab of expression preview dialog. Added variables section.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@131 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 19:06:58 +00:00
David Huynh
dce42400d4
Fixed bug introduced while trying to delay constructing the candidates arrays in Recon objects.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@130 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 17:47:34 +00:00
David Huynh
ec1604e815
Added support for starring rows.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@129 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 07:45:12 +00:00
David Huynh
92806cea7e
Forgot one instance of "gridlock" after project rename.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@128 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 01:49:41 +00:00
David Huynh
8992531d02
Documented functions and controls in expression language.
...
Better error checking in operator calls.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@127 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 01:40:06 +00:00
David Huynh
607fca04cb
Added a few more math functions.
...
Fixed expression preview dialog to use tabs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@126 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-23 00:33:39 +00:00
David Huynh
94fbd97bc4
Added a few more expression functions.
...
Bind row index when filtering rows, so we can create facets based on row indices.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@125 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 23:51:44 +00:00
David Huynh
0f505c72c5
Delay constructing the candidates array in recon objects to save memory.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@124 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 23:27:16 +00:00
David Huynh
c45e0edc10
Lower recon batch size back to 10.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@123 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 23:06:34 +00:00
David Huynh
4ed7b45e41
Don't use schema restriction for protograph link suggest because it's not a "soft" restriction (so if the user wants a property that doesn't belong to the type, there is no way to get it).
...
More expression functions and controls.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@122 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 23:04:46 +00:00
David Huynh
5e9be8c258
Support reusing newly created topics for cells with the same content.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@121 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 22:15:48 +00:00
David Huynh
e4b01cb36c
Make similar cell judgments an abstract operation.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@120 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 20:25:45 +00:00
David Huynh
c98a8ad552
Pulled the operations package up one level.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@119 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 18:42:25 +00:00
David Huynh
c50de52883
Improved the "extract operations" dialog to let user select which operations to extract. Also show history entries that cannot be abstracted.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@118 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 06:31:09 +00:00
David Huynh
1227c9dff4
Centralized mapping between operation names and their reconstructors.
...
Implemented comparison operators.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@117 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 04:23:25 +00:00
David Huynh
934c0f81c3
Forgot to check in image files.
...
Added commands for judging similar cells.
Started to fix/unify terminologies for recon operations.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@116 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-22 01:28:13 +00:00
David Huynh
b3167a1a9f
Added option for automatically approving best recon candidates that match the expected type and score at least some minimum score.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@115 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-21 08:11:07 +00:00
David Huynh
b4935f576c
Tweaked recon type guessing heuristic: remove "role" and "annotation" types, and rank types based on result orders rather than relevance scores.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@114 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-21 07:32:16 +00:00
David Huynh
b730dfd8f9
Added commands for searching for specific topics to match cells with.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@113 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-20 00:47:08 +00:00
David Huynh
ea2c904704
Use the schema index to suggest properties in the schema alignment dialog.
...
Fixed minor bug in triple loader transposer that wrote a bad triple for each literal cell value.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@112 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 22:56:29 +00:00
David Huynh
c4396798e3
Made schema alignment dialog shorter to fit on laptop screens.
...
Make sure main menu bar doesn't stay above dialog overlays after it gets deactivated.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@111 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 18:47:46 +00:00
David Huynh
846e540ff6
Keep track of type names of reconciled columns so we can display them later in the schema alignment dialog.
...
Automatically create properties linking to all columns when starting with an empty protograph.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@110 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 18:32:48 +00:00
David Huynh
acb0c91306
Re-organized column header popup menu.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@109 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 17:55:46 +00:00
David Huynh
6c7557eeff
Minor bug fixes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@108 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 17:47:04 +00:00
David Huynh
7f528c320d
Implemented a "busy" indicator dialog, useful for preparatory steps like guessing types before showing the recon dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@107 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 17:38:47 +00:00
David Huynh
bc412b99ea
Fixed bug in triple loader transposer: properties didn't get asserted before.
...
Made triple loader transposer index its output variables.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@106 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 07:05:09 +00:00
David Huynh
5264c829ae
A bit more careful error handling during recon.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@105 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 01:17:35 +00:00
David Huynh
28a86dfe0f
Automatically guess types to reconcile a column, using Stefano's trick in his "cupid" acre app.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@104 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 00:28:34 +00:00
David Huynh
4b2e48614b
Actual work in operations must be delayed until their changes are applied.
...
Column addition change must track the new cell index that it allocates when it is first applied.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@103 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-18 23:27:40 +00:00
David Huynh
8831703a2c
Implemented "apply operations" feature.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@102 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-18 05:00:56 +00:00
David Huynh
604dd53ebd
Engine configs were not deserialized properly when abstract operatons are retrieved.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@101 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:40:41 +00:00
David Huynh
b1afd12bd5
Forgot a console.log().
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@100 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:31:28 +00:00
David Huynh
8c41af9c12
Allow operations to be extracted in abstract forms.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@99 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:30:09 +00:00
David Huynh
32157ce76b
Changed operations to record column names instead of cell indices.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@98 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 00:26:38 +00:00
David Huynh
e6a98f23bd
Implemented triple loader preview.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@97 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 23:32:12 +00:00
David Huynh
aa530395d2
Use tabs in the schema alignment dialog to get more space.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@96 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 20:45:04 +00:00
David Huynh
5de0c36f86
Protograph preview now works.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@95 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 20:15:19 +00:00
David Huynh
0d11e469a6
Added a preview pane in the schema alignment dialog, but it's not hooked up yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@94 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-13 01:08:12 +00:00
David Huynh
b17f71c8df
More work on schema alignment dialogs: schema data from Freebase is used to automatically fill in a lot of fields in the protograph.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@93 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-13 00:11:42 +00:00
David Huynh
8189ba74fd
Schema alignment dialog now saves protograph and re-renders it properly.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@92 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 22:11:49 +00:00
David Huynh
425140261f
We're starting to be able to save protographs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@91 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 20:29:56 +00:00
David Huynh
634d666949
More work on the schema alignment node dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@90 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 02:23:26 +00:00
David Huynh
f5942773ec
Still more work on the protograph, toward being able to build and save a protograph, but it's not working yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@89 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-11 06:44:48 +00:00
David Huynh
d227db0cc6
Eliminate hash maps from recon objects--they are expensive.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@88 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 19:25:21 +00:00
David Huynh
5cd147ea3c
Compute record indices and render them instead of row indices.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@87 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 18:54:53 +00:00
Stefano Mazzocchi
91d0133a76
- build a windows distribution
...
- update to the latest ant to use in case one is not present locally
git-svn-id: http://google-refine.googlecode.com/svn/trunk@85 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 07:30:16 +00:00
David Huynh
36e609d649
Minor fix to build.xml.
...
More work on schema alignment dialog: add another dialog for configuring a protograph node.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@84 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 06:39:55 +00:00
David Huynh
242e23c085
The schema alignment dialog is starting to work. The protograph gets rendered and is interactive. No saving yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@83 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 01:15:25 +00:00
David Huynh
39899b9eeb
Added topic block previews for recon candidates.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@82 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 20:36:47 +00:00
Stefano Mazzocchi
7510594d0d
build MacOSX desktop app and disk installer
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@81 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 20:21:10 +00:00
David Huynh
97e2e0eddc
Implemented "judge one cell" command for making recon judgment per cell.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@80 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 20:04:43 +00:00
David Huynh
66457f0515
Factored out column header UI and cell UI to prepare for single cell commands.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@79 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 19:23:11 +00:00
David Huynh
c73ec864db
A bit more prep work on rendering protograph nodes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@78 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 06:46:40 +00:00
David Huynh
f8a1daba62
Handle formula cells in Excel files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@77 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 01:13:11 +00:00
David Huynh
8802323839
Some more code for the schema alignment dialog box, but nothing is working yet, and I still need to think about how this should work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@76 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 00:45:54 +00:00
David Huynh
402858a784
Process widget was causing the UI to get rendered twice at start-up.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@75 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 00:26:57 +00:00
David Huynh
b7cf18b86a
Save a change right after it gets applied rather than when it gets created. This is because when a change gets applied, it might grab onto the old data in order to able to revert later, and we need to save that old data together with the change.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@74 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 23:57:37 +00:00
David Huynh
cd376c7532
Added support for Excel 2007 XML file format.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@73 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 23:44:33 +00:00
David Huynh
736c6ec1de
Added main menu bar.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@72 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 21:41:49 +00:00
Stefano Mazzocchi
2b985bf45a
moving json support in its own jar (code was taken today directly from json.org and compiled and packaged by me)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@70 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 20:46:27 +00:00
Stefano Mazzocchi
1f5b27653e
POI deprecated the use of short, good thing
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@69 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 19:54:09 +00:00
Stefano Mazzocchi
1343162a75
major rewrite of the foundation:
...
- de-maveniziation (uses the same code that Acre uses to drive jetty directly)
- removed all dependencies on external javascript code (jquery and suggest) by making a local copy (this makes gridworks totally self-serving, meaning that you can use it even if you don't have any internet connectivity)
- fixed a NPE when the servlet is shutdown before any project is loaded
- found a way to spawn a browser directly from the java code (untested in windows)
- added two ant tasks to generate windows and macosx stand-alone binaries (unused just yet)
To run, just type "./gridworks run" at the command line
git-svn-id: http://google-refine.googlecode.com/svn/trunk@65 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 23:15:50 +00:00
David Huynh
8f186a5f10
Added a help panel to the expression preview dialog. It gets populated by function and control names for now; more info will come later.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@64 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 09:02:22 +00:00
David Huynh
d3f97fea93
While importing data, use null for cells with empty text.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@63 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 07:16:39 +00:00
Stefano Mazzocchi
a61f35079a
make eclipse happier by removing @Override annotations when really it's an interface method implementation
...
(no functional changes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@62 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 06:47:52 +00:00
David Huynh
a025b272bd
String.isEmpty() is no longer there (?!).
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@61 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 06:16:46 +00:00
David Huynh
2ceaa53390
Split css files.
...
Allow expressions to be evaluated on null cells too.
Various minor UI polishing touches and minor bug fixes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@58 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 03:36:40 +00:00
David Huynh
51de415809
Added recon config to column model so we can use that to do automatic schema alignment.
...
Fixed bug in recon operation that also tried to reconcile blank cells.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@57 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 01:07:45 +00:00
David Huynh
2eccab4c71
We need to recompute the row context dependencies after cell and column changes, but not after recon changes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@56 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-06 06:40:17 +00:00
David Huynh
bf79b24bf3
A bit more modeling for the proto-graph.
...
Two spinners to prepare for more async UI indications.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@55 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-06 02:54:01 +00:00
David Huynh
8701d504e6
More error checking.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@54 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-06 02:41:31 +00:00
David Huynh
149330fe05
Fixed bug in context row dependency calculations.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@53 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 23:32:29 +00:00
David Huynh
02b0c40558
Polished popup menus: popouts are positioned better to show the right arrows, expanded sub-menus are persistently highlighted in gray.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@52 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 23:24:52 +00:00
David Huynh
98a16ca500
Render contextual rows when filtered.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@51 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 23:05:00 +00:00
David Huynh
755b01c2c4
Keep number of bins below 100, or Google Charts won't work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@50 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 20:12:56 +00:00
David Huynh
764952865a
Fixed bug in range slider facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@49 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 20:05:44 +00:00
David Huynh
3e30ab2a33
Added length function and support for creating custom numeric (range slider) facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@48 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 19:43:43 +00:00
David Huynh
16dda46a61
Refactored importers, adding support for Excel files.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@47 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 19:19:38 +00:00
David Huynh
3ea230b11c
Minor bug fixes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@46 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 08:56:44 +00:00
David Huynh
8444136be1
Added control constructs to expression language.
...
Fixed bugs in multi-value cell split/join operations.
Condensed movies-condensed.tsv test data further.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@45 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 08:39:10 +00:00
David Huynh
937d91ab1d
Implement split and join multi-value cells features.
...
Reorganized column header popup menu.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@44 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 06:29:27 +00:00
David Huynh
62c8c72dc4
Added export row feature.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@43 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 01:58:06 +00:00
David Huynh
93f0dfd63a
Started to render column groups. Added a sample data set with CVTs.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@42 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-05 01:27:14 +00:00
David Huynh
f8e15798e2
Implemented column addition and removal features.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@41 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 23:38:40 +00:00
David Huynh
bacb71ab6d
Refactoring to support abstract operations.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@40 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 20:51:44 +00:00
David Huynh
9a63bdd821
Unified the way expression bindings are created, so everywhere we'll have the same variable bindings.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@38 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 06:30:56 +00:00
David Huynh
4ee1684fa3
Break the commands package into several for clearer organization.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@37 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 06:12:22 +00:00
David Huynh
720f147c09
Moved concrete change classes out to their own package. Added a few change classes for columns, but they are not hooked up yet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@36 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 06:00:26 +00:00
David Huynh
863493950f
Started to model the protograph for schema mapping.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@35 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 00:57:09 +00:00
David Huynh
44ea417d0d
Range slider facets now display sub-distribution when filtered by other facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@34 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-04 00:29:40 +00:00
David Huynh
21f2403146
Encapsulated a bunch of fields.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@33 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 21:57:38 +00:00
David Huynh
97fd9422f6
Started to add the key columns and column groups into the data model.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@32 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 20:55:48 +00:00
David Huynh
fb34d6f507
Implemented preview expression dialog.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@31 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 09:27:53 +00:00
David Huynh
a123b0840e
Styled the history widget and hopefully made it more comprehensible.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@30 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 07:54:23 +00:00
David Huynh
d107723b6b
Use real histograms, not dot line plots, for range slider facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@29 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 07:27:11 +00:00
David Huynh
4ec35cf5bb
A few find/replace from gridlock to gridworks.
...
Added list of existing projects on front page.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@28 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 06:23:40 +00:00
David Huynh
047f59e461
Renamed java packages.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@27 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 02:29:47 +00:00
David Huynh
c7693a9ac2
Polishing range slider facets' histograms.
...
Fixed scrolling bug in data table view.
Support range facets on for cell content as well.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@26 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 01:24:41 +00:00
David Huynh
ed5eae83af
Added histograms to range slider facets.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@25 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-03 00:20:42 +00:00
David Huynh
00696a96fc
Added commands to approve, approve new, and discard recon candidates.
...
Support sections in menus.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@24 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 20:50:35 +00:00
David Huynh
ac46653aa3
Added text search facet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@23 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 19:16:09 +00:00
David Huynh
f3d9a7804e
Added page size controls to data table view.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@22 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 18:43:19 +00:00
David Huynh
d054cc709f
Minor bug in recon type facet's expression.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@21 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 01:50:22 +00:00
David Huynh
f870333715
Added a bunch of custom recon facets. Implemented range slider facet.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@20 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-02 01:36:02 +00:00
David Huynh
58f2dd8f60
Recon process seems to be working. We can now filter rows by recon features using custom expressions.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@19 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 20:57:52 +00:00
David Huynh
fe8810aa07
Refactored to use JSONWriter to be more efficient at generating output.
...
Added created and modified time fields to project metadata.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@18 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 19:16:09 +00:00
David Huynh
17cbe6b62d
Added dialog system.
...
Started to implement recon dialog.
Added long-running recon process, which doesn't actually do reconciliation just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@17 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 08:31:50 +00:00
David Huynh
06b5373151
Client-side UI widget for long running processes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@16 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 03:22:35 +00:00
David Huynh
86f8c630ad
Facets can now be removed.
...
Minor polishing on history widget.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@15 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-01 00:19:41 +00:00
David Huynh
8b22eb594f
Refactoring and adding more functions.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@14 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-31 23:15:50 +00:00
David Huynh
e24d40c3da
Faceted browsing is starting to work.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@13 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-30 01:05:30 +00:00
David Huynh
dce2ec71aa
Started client-side scaffolding for faceted browsing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@12 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-29 00:46:15 +00:00
David Huynh
e0365f45c8
More server-side scaffolding for faceted browsing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@11 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-28 01:43:09 +00:00
David Huynh
23b9e313b8
Implemented expression parser.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@10 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-27 22:27:22 +00:00
David Huynh
6889d0e58a
Server-side scaffolding for faceted browsing.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@9 7d457c2a-affb-35e4-300a-418c747d4874
2010-01-27 07:52:05 +00:00