Commit Graph

518 Commits

Author SHA1 Message Date
David Huynh
72f1f0956e More polishing on the facet panel.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@498 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-18 00:56:09 +00:00
Stefano Mazzocchi
a115251992 polishing usability of the scatterfacet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@496 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 19:21:39 +00:00
David Huynh
d61473e989 Added dispose method to facets so they can clean up themselves.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@495 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 06:58:30 +00:00
David Huynh
98f12544de Quick fix to prevent jittering in facet panel.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@494 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 06:40:42 +00:00
David Huynh
950abbc7ea Scatterplot facet now shows 2 layers of dots: one for the base distribution and one that's constrained by the other facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@493 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 06:19:47 +00:00
David Huynh
3b63e0b969 Scatterplot facet can now filter the rows.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@492 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 05:59:25 +00:00
Stefano Mazzocchi
85d7ed6b89 cleanup
git-svn-id: http://google-refine.googlecode.com/svn/trunk@491 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 03:02:34 +00:00
Stefano Mazzocchi
7a716a4a1b - updgraded commons-coded to the last version (needed for base64 encoding of data: uris)
- added the ability to embed the scatterplot inside the returned json data with data: uris (although it doesn't seem to work well)
- connected the selection logic to the scatterfacets (although it doesn't seem to filter the rows... and I'm puzzled as why)
- reduced cut/paste and code overlap between the scatterplot generator and the scatterplot facet


git-svn-id: http://google-refine.googlecode.com/svn/trunk@490 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 03:00:38 +00:00
David Huynh
8085208cf0 Fixed toTitlecase to handle fully capitalized text.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@489 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 23:42:52 +00:00
David Huynh
b7c3f99e3c Added help message in history panel.
Better resizing for left panel's tabs.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@488 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 23:38:49 +00:00
David Huynh
f1e05194e4 New column header popup menu button.
Reorganized column header menus.
Styled page size controls.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@487 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 22:39:30 +00:00
David Huynh
9e73a4e68c Started to work on a MARC importer. It doesn't work properly yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@486 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 19:52:01 +00:00
David Huynh
67662fcc96 Escape strings from TSV exporter.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@485 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 06:45:57 +00:00
Stefano Mazzocchi
1e5a787281 avoid ArrayOutOfBoundsException
git-svn-id: http://google-refine.googlecode.com/svn/trunk@484 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 20:24:23 +00:00
Stefano Mazzocchi
397861b612 - replace the 'cos' library with the apache 'commons-fileupload' for licensing reason (the cos library had a weird arm-twisting license that forced you to buy an o'reilly book on servlets for each developer in your company... good thing I read it all)
- some tweaks on imgareaselect's look


git-svn-id: http://google-refine.googlecode.com/svn/trunk@483 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 18:42:41 +00:00
Stefano Mazzocchi
93a8f78192 - updated to latest jquery (1.4.2)
- removed commons-math which I don't use anymore
- added imgareaselect
- added a bunch of licenses for the javascript libraries dependencies


git-svn-id: http://google-refine.googlecode.com/svn/trunk@482 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 06:56:07 +00:00
Stefano Mazzocchi
8cf69301a5 added a new command to get column metadata prior of creating the scatterplot half-matrix, this allows us to build a much more compact table and make the browser crawl a little less
git-svn-id: http://google-refine.googlecode.com/svn/trunk@481 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 06:20:56 +00:00
David Huynh
155b5a483a When deleting project dirs, we need to recurse into them ourselves.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@480 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 00:17:08 +00:00
David Huynh
1d938bc4d0 Better MQL batching during extending data operations.
Tried to use JSON streaming in changes as well.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@479 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 00:09:14 +00:00
David Huynh
2277f45ef6 For jython, wrap native values properly using Py.java2py().
git-svn-id: http://google-refine.googlecode.com/svn/trunk@478 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 21:30:39 +00:00
David Huynh
24a7ea91b6 Fixed bugs
- MassEditOperation was barfing when engineConfig was missing
- When parsing JSON in streaming mode, get long instead of int and double instead of float so that we won't get overflow exception.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@476 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 20:34:29 +00:00
Stefano Mazzocchi
ad1b920e0e better selection procedure for the scatterplot facet
(the scatterfacet itself still doesn't work though)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@474 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 05:14:33 +00:00
Stefano Mazzocchi
3bae823010 fixed eclipsed warning (no functional change)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@473 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 02:39:24 +00:00
Stefano Mazzocchi
7680d98b49 synch credits
git-svn-id: http://google-refine.googlecode.com/svn/trunk@472 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 02:14:46 +00:00
David Huynh
4a06c49a9a Added streaming json parser for faster re-loading of existing projects.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@470 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 23:57:03 +00:00
David Huynh
a1a8758c37 Added options for specifying # lines the header columns take, and the # lines to skip processing entirely initially.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@468 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 21:23:41 +00:00
David Huynh
da55033df3 Forgot a console.log() in history-widget.js.
Updated view header to look like James' new design.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@467 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 08:06:05 +00:00
David Huynh
01fbca0fe6 Added help message when the facet panel is empty.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@466 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:52:01 +00:00
Stefano Mazzocchi
dc4b63d2bf forgot a piece
git-svn-id: http://google-refine.googlecode.com/svn/trunk@465 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:27:20 +00:00
Stefano Mazzocchi
de9dfbfba7 got scatterfacet to show \.o./
git-svn-id: http://google-refine.googlecode.com/svn/trunk@464 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:26:26 +00:00
David Huynh
84bc49b3d5 Got the history panel to resize properly in its tab.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@463 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:08:23 +00:00
Stefano Mazzocchi
ea2eff6117 transpose the scatterplot matrix (makes better use of space above the fold)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@462 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 06:40:33 +00:00
David Huynh
a2db5590ac Trim column names on import.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@461 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 06:28:13 +00:00
David Huynh
a0b119b346 Polished list facet a bit more to look like James' design.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@460 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 05:41:32 +00:00
David Huynh
93a6be6ac6 More styling work on the facets/history tabs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@459 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 04:49:28 +00:00
David Huynh
666bcdbf8d Hooked scatterplots up to generate the right facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@458 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 04:48:20 +00:00
Stefano Mazzocchi
e232a90a73 progress but still no worky on the scatterfacet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@457 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 03:44:42 +00:00
Stefano Mazzocchi
ba85f50e39 adding log-log support to the scatterplot matrix and more controls
(the scatterfacet still doesn't work but this is already more useful)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@456 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 00:25:43 +00:00
David Huynh
7391f760db Moved history widget and facet panel into tabs on the left according to James' new design. More polishing is needed.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@455 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 23:48:48 +00:00
David Huynh
8b95248c75 Fixed bug where after reconciling by ID, GUID, or key would generate a buggy numeric range facet, since all the scores were artificially the same.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@454 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 22:19:44 +00:00
Stefano Mazzocchi
7ab1acd801 skeleton code for scatterfacet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@453 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 19:22:49 +00:00
David Huynh
f6dcdf23c4 Added temporary message for the Load into Freebase feature.
Fixed command Match Filtered Cells To, which didn't work because freebase suggest no longer returns types.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@449 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 03:59:19 +00:00
David Huynh
8fb23913ce Added "time" part option to datePart function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@448 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 23:07:56 +00:00
David Huynh
ce8963d009 Added datePart function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@447 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 23:01:34 +00:00
David Huynh
f7e830e709 Fixed bug in which editing a single cell and then starring the same row seemed to revert the cell back to its original content.
Added an option for not guessing cell value type during import.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@446 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 21:54:56 +00:00
David Huynh
75ea8304a3 In numeric range facets, show the other choices only if they have positive counts in the base distribution.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@445 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 08:19:05 +00:00
David Huynh
5928a689e2 Use RowParser for parsing the header row, too.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@444 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 03:42:44 +00:00
Stefano Mazzocchi
73288c5e2a Fixed a bug in case the property is not defined. (thanks to Iain Sproat for the patch)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@443 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 16:58:00 +00:00
Stefano Mazzocchi
81fb2f1740 first step at scatterplot facet selector
git-svn-id: http://google-refine.googlecode.com/svn/trunk@442 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:28:06 +00:00
Stefano Mazzocchi
ed0778f18d fixed a problem I introduced earlier with a jsunit scope fix
git-svn-id: http://google-refine.googlecode.com/svn/trunk@441 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:16:40 +00:00
Stefano Mazzocchi
a286b805d5 no reason to process the contextual rows while clustering
git-svn-id: http://google-refine.googlecode.com/svn/trunk@440 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:11:31 +00:00
David Huynh
70449cf7c8 Better error catching in toNumber function.
Watch out for the string "Infinity" while importing data sets: don't parse it into a double.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@438 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 21:59:50 +00:00
David Huynh
8950e87e02 When re-loading existing projects from disk, cache recon objects by their IDs to lower memory consumption.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@437 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 19:32:50 +00:00
David Huynh
dff870519e Made numeric range index handle value.log() properly when value is 0 or negative.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@436 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 05:24:46 +00:00
David Huynh
a0d8c385f9 Do a bit more checking when retrieving project metadata just in case project metadata is null.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@435 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 04:52:32 +00:00
Stefano Mazzocchi
4a139842d4 more jslint cleanups
git-svn-id: http://google-refine.googlecode.com/svn/trunk@434 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 01:48:50 +00:00
Stefano Mazzocchi
cc2209074b more jslint goodness
git-svn-id: http://google-refine.googlecode.com/svn/trunk@433 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 01:14:33 +00:00
Stefano Mazzocchi
16a2600a49 now it's jslint time to be happier: (!= null) -> (!== null)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@432 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 01:00:44 +00:00
Stefano Mazzocchi
d3d40d608a bunch of PMD-induced fixes
(now the PMD report is clean)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@430 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 00:14:11 +00:00
David Huynh
759824e1b4 Bug fix: editing one facet choice while some other choices are selected resulted in no change.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@429 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 22:16:12 +00:00
David Huynh
c4460fdfdd Made history panel expand on click rather than on hover.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@428 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 21:32:36 +00:00
Stefano Mazzocchi
7526c4e582 cleanups (no functional changes)
this makes pmd and javac on linux happier


git-svn-id: http://google-refine.googlecode.com/svn/trunk@427 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:46:02 +00:00
Will Moffat
fff6eb0aa2 JSLint: legimate comparisions to 0 and null should be made using ===
git-svn-id: http://google-refine.googlecode.com/svn/trunk@425 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:18:38 +00:00
Will Moffat
41a52f135d JSLint: foo == 0, foo == null --> !foo
git-svn-id: http://google-refine.googlecode.com/svn/trunk@424 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:16:08 +00:00
Will Moffat
94aa2e0168 JSLint: semicolons
git-svn-id: http://google-refine.googlecode.com/svn/trunk@423 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 19:52:23 +00:00
Will Moffat
fbc1d04efb JSLint fixes
git-svn-id: http://google-refine.googlecode.com/svn/trunk@421 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 18:21:28 +00:00
David Huynh
bd24e5c7ff Set ajax_in_progress attribute on <body> during ajax operations in order to facilitate testing.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@416 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-07 21:31:50 +00:00
Will Moffat
efb9a43b73 Added: toDate() now supports formats.
My dataset has the same dates written in two styles: Nov-09 and 11/09
You can now parse this using: value.toDate2('MM/yy','MMM-yy').toString('yyyy-MM')



git-svn-id: http://google-refine.googlecode.com/svn/trunk@414 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-07 19:12:08 +00:00
David Huynh
302c27687c "type" in freebase suggest results got dropped so we need to fetch the result's types ourselves.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@409 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-07 18:23:07 +00:00
David Huynh
0996b9e1dd Gzip project export tar files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@394 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 22:28:30 +00:00
David Huynh
5320cc6587 Make duplicated column names unique during import by appending indices to them.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@392 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 17:55:36 +00:00
David Huynh
9d9329ca96 Implemented row remove command.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@391 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 07:47:44 +00:00
David Huynh
1fd85c62bf Implemented column rename command.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@390 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 07:15:34 +00:00
David Huynh
a28a8d1769 Fixed bug in collapse and expand all columns commands.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@389 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 06:18:43 +00:00
David Huynh
93d6f9fc54 Better error message for bad regular expressions in GEL.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@388 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 06:18:07 +00:00
David Huynh
f402db10af Implemented inter-project joins.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@387 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 05:35:48 +00:00
Stefano Mazzocchi
771810bc0d avoid exception if there is only one extension in the whole archive
git-svn-id: http://google-refine.googlecode.com/svn/trunk@385 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-05 21:36:27 +00:00
Stefano Mazzocchi
2efbf0031f - removed the 'thirdparty' directory (now the 'gridworks' script will download and install needed tools if they are not present in the system already)
- added 'findbugs' command that uses the findbugs static analyzer to look for problems in the code
- fixed a bunch of issues that findbugs found (a few methods would go a little faster, and a few NPE will be avoided... nothing major but good to have)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@382 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-05 07:15:16 +00:00
Stefano Mazzocchi
798b2a36ca - archive and compressed file importer (supports zip, tar, gz, bz2, tar.gz and tar.bz2)
(works by loading the files that have the most common extensions in the archive)
- changed default max heap to 3Gb


git-svn-id: http://google-refine.googlecode.com/svn/trunk@381 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-04 07:48:47 +00:00
Stefano Mazzocchi
c24ec94835 had to shuffle around a bunch of classes to separate the main server classloader from the context classloader and allow reloading to happen for real
git-svn-id: http://google-refine.googlecode.com/svn/trunk@377 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-03 01:01:11 +00:00
Stefano Mazzocchi
72203cd3d5 - moved all code that contained MIT IP outside (http://code.google.com/p/simile-vicino/)
- moved bzip2 and tar code from apache ant into their own jar files
- now gridworks source contains only com.metaweb.* code everything else is a jar dependency
- started to work on archive importer


git-svn-id: http://google-refine.googlecode.com/svn/trunk@376 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 23:40:12 +00:00
Stefano Mazzocchi
4eda7ae2c0 avoid an array out of bounds exception in case there are no columns in the dataset
(I know, it should not happen but when it does let's not barf)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@375 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 20:21:41 +00:00
Stefano Mazzocchi
62f5f21ca3 atom is handled as well by the XML importer
git-svn-id: http://google-refine.googlecode.com/svn/trunk@374 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 06:44:05 +00:00
Stefano Mazzocchi
83faee3aa9 add a frame-less menu item in macosx to be able to open another gridworks browser window/tab in case we closed it by mistake
(no idea how to do this on windows, though, since there is no frame-less menu concept there)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@373 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 06:39:41 +00:00
Stefano Mazzocchi
521acda025 - pass the svn revision as format version (for more detailed verification)
- add an 'autoreload' setting that makes Gridworks autoreload its self if a class gets changed
(this is useful to make development cycles faster when working on the java code with autocompiling IDE like Eclipse or IDEA)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@372 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 00:52:38 +00:00
Stefano Mazzocchi
d1e72e7797 make the undo dialog closable
git-svn-id: http://google-refine.googlecode.com/svn/trunk@371 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 19:11:37 +00:00
Stefano Mazzocchi
988378c761 Hmm, String.split() bites us again: use the commons-lang one instead to avoid having to escape regexp values (this was preventing a user from splitting by "." in GEL)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@370 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 17:49:31 +00:00
Stefano Mazzocchi
0e07ec7acc crude, I know, but for now make Gridworks digest RDF/XML as it was XML (works surprisingly well, btw)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@369 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 16:56:38 +00:00
Stefano Mazzocchi
dced641599 - added the ability to specify the character separator for CSV or TSV files that don't use commas or tabs (this was needed to parse a dataset that we got from the BBC to try things out)
- used commons-lang split function instead of the java String.split one, this is necessary to avoid having to escape separators that might be confused for regexps


git-svn-id: http://google-refine.googlecode.com/svn/trunk@368 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 22:34:21 +00:00
Stefano Mazzocchi
77b452e87f adding version information to the about page
NOTE: this shows up only in the packaged distribution


git-svn-id: http://google-refine.googlecode.com/svn/trunk@367 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 19:20:32 +00:00
Stefano Mazzocchi
3c9af6501e more consistent naming and various polishing
git-svn-id: http://google-refine.googlecode.com/svn/trunk@364 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 01:07:52 +00:00
Stefano Mazzocchi
5884d257db default to 'relevance' instead of recon (which is faster). Change to recon if the user suggests schema hooks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@363 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 01:07:17 +00:00
Stefano Mazzocchi
571f2c9ab3 - better README
- made the build system obtain and use svn revision info directly in version.js
- fixed lunch4j initial memory usage
- added .ini support for .exe starting in windows
- more robust up-to-date logic that uses SVN revisions instead of dates
- connected to new freebase.com/labs/gridworks web site


git-svn-id: http://google-refine.googlecode.com/svn/trunk@362 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 00:17:40 +00:00
Stefano Mazzocchi
7c132cfa53 clean eclipse warnings
git-svn-id: http://google-refine.googlecode.com/svn/trunk@357 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-29 20:51:18 +00:00
David Huynh
1d0e6abaf8 Got some work done on the plane:
- better detection of record XML elements in XML importer
- XML importer creates column groups and data table view renders them


git-svn-id: http://google-refine.googlecode.com/svn/trunk@356 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-27 05:23:09 +00:00
David Huynh
2a9fbd7d81 Made sure columns are named hierarchically in XML importer.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@355 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 23:15:10 +00:00
David Huynh
df7389876f First shot at XML import.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@354 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 23:08:08 +00:00
David Huynh
4e76155652 Use application/x-unknown when exporting TSV so that the browser just saves the result. This is good for large exports, which overload the browser if the browser tries to display them.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@353 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 17:51:27 +00:00
David Huynh
47cad64a3f Properly unescape \t, \r, \n, \\.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@352 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 17:50:44 +00:00
David Huynh
30e3ca4965 Added splitByLengths function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@351 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 04:30:31 +00:00
David Huynh
4df1c4107a Fixed a bug introduced recently: recon candidates were not serializing their topic types for the data view, so in the data view we can't send back a candidate's types when the user wants to match the candidate to some cells. I need to figure out a better way to optimize this.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@350 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 03:58:52 +00:00
David Huynh
00cce1b99a Styling tweaks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@349 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 21:24:44 +00:00
David Huynh
c6cd48a6d2 Polished the path in the header pane.
Made dropdown menu graphics a little more subtle.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@348 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 21:12:13 +00:00
David Huynh
32e395d0e6 Updated version's date for another rolling release.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@347 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 19:43:06 +00:00
David Huynh
cbfa77dcaa Polished history widget. Now there's a link to roll the widget up.
Made sure busy dialog also has rounded corners on Chrome.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@346 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 19:30:50 +00:00
David Huynh
5c97177efd Added "reset" and "remove" links to facet panel.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@345 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 18:53:29 +00:00
David Huynh
85d1671d6e Fixed minor bug: recon wasn't saving out its candidates if its judgment is Matched. So when a project is saved and reloaded, it loses all of the recon candidates except for the matches.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@344 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 18:02:08 +00:00
David Huynh
ac57dea9c7 Do our own positioning of the process widget rather than using margin:auto, so that the links on the top header panel don't get obscured.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@343 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 04:48:05 +00:00
David Huynh
455802bffb Alert user of new version to download, if any.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@342 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 04:36:01 +00:00
David Huynh
2846d66261 Detect max cell index on load, just in case the max cell index we've stored previously was out of whack.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@341 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 03:19:17 +00:00
David Huynh
f8d30e9e8e Don't send back recon candidate types for rendering cells.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@340 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 01:17:45 +00:00
David Huynh
c07ba83a36 Don't send back recon candidate types for rendering cells.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@339 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 01:17:32 +00:00
David Huynh
3dc4db020f Support quick undo of the last operation (Ctrl-Z).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@338 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 00:26:28 +00:00
David Huynh
6d8776953d Added case sensitive and regex checkboxes to text search facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@337 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 23:01:53 +00:00
David Huynh
f5d270e35a Fixed "off by one bucket" bug in range facet's binning algorithm.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@336 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 22:04:29 +00:00
David Huynh
19ba207d27 Re-ordered the "other" choices in numeric range facets to make better use of space.
Changed main layout of whole application so that the horizontal scrollbar of the data table is visible without scrolling.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@335 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 21:48:36 +00:00
David Huynh
1d20b33cf1 Documented the history package.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@334 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 18:19:31 +00:00
David Huynh
ec0110d65b Documented gel.* packages.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@333 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 05:44:35 +00:00
David Huynh
60dd7eab82 Documented expr.* packages.
Converted some tabs into spaces.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@332 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 00:54:56 +00:00
David Huynh
60f60507f7 Fixed minor bug introduced recently into the Export Project menu command.
Documented the commands.* packages.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@331 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-21 20:26:35 +00:00
David Huynh
7648126a5e Made facets deal with java.util.Collection rather than just Object[].
Documented the browsing.* packages.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@330 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-21 07:14:39 +00:00
David Huynh
d90e75dff1 Started a round of documentation.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@329 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-20 23:56:28 +00:00
David Huynh
a43b2a72c1 Made various GEL functions and the forEach control work with java.util.List and java.util.Collection in addition to just Object[].
Added field columnNames to row object.
Added 1-bounded numeric log facet.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@328 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 23:04:17 +00:00
Vishal Talwar
6fba7d1e7f sped up jython evaluation by calling function directly instead of invoking parser on string representation of function call
git-svn-id: http://google-refine.googlecode.com/svn/trunk@327 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 21:45:01 +00:00
David Huynh
ff0049307e Increased file upload size limit to 1GB.
Fixed charset detector to be more robust in trying more than one charset.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@326 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 19:53:11 +00:00
David Huynh
fd85be7816 Added licenses to about page; styled it a bit.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@325 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 18:59:37 +00:00
David Huynh
8dd0dea472 Try to roundtrip reconciled IDs as much as possible when import/export as Excel files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@322 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 00:32:52 +00:00
David Huynh
5db6ee8ae5 When the user invokes the Export Tripleloader command, check if the protograph is null or not and alert accordingly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@321 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 23:35:54 +00:00
David Huynh
b54f7162a8 Made histogram widget capable of highlighting the selected range.
Added value.log() common numeric facet.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@320 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 22:58:06 +00:00
David Huynh
91241539cf Switched to a canvas-based implementation of histograms.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@319 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 20:45:52 +00:00
Stefano Mazzocchi
b9b4bb0ab4 better dropdown button that doesn't look disabled
git-svn-id: http://google-refine.googlecode.com/svn/trunk@318 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 01:47:03 +00:00
David Huynh
b7338e13f2 Tweaked column header menu dropdown icon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@317 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-18 00:35:34 +00:00
David Huynh
d56bbc1208 Renamed Protograph Node dialog's title to Schema Skeleton Node.
Made deleting a protograph link update the previews.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@316 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 22:55:40 +00:00
David Huynh
124960e756 Made "search for match" dialog commit on fb-select event.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@315 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 22:42:59 +00:00
David Huynh
07cf85b2a5 Added make_cli option for generating a zip containing all files necessary to do 'gridworks run' at the command line. This excludes Java source files but contains pretty much everything else.
Added make_all option that makes dmg, exe, and cli.

Added html and xls exporters.
Made exported files named after project names rather than project IDs.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@314 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 22:30:16 +00:00
David Huynh
07945f9cde A more helpful error message when the excel importer fails.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@313 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 21:43:58 +00:00
David Huynh
cd062cf028 Minor bug: recon candidate's "id" field should return id, not name.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@312 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 19:54:27 +00:00
David Huynh
b26160dc2b Hopefully a more robust way to get the user data dir on Windows, especially on Windows Vista 64-bit, which jdatapath.dll isn't built for.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@311 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 03:51:58 +00:00
David Huynh
b8519e42d6 Use non-breaking hyphens for "re-match" links.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@309 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 21:01:27 +00:00
David Huynh
999c18cae7 Better date/time format for projects' last modified fields.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@308 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 20:55:01 +00:00
David Huynh
1a8a236cdd Added an error page for when a project create operation fails.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@307 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 18:40:07 +00:00
David Huynh
798805edc5 More styling tweaks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@306 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 18:11:08 +00:00
David Huynh
4e262f0e1d Styling tweaks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@305 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 17:38:38 +00:00
Stefano Mazzocchi
ad6e8c2e0c add the ability to browse the values of a particular cluster
git-svn-id: http://google-refine.googlecode.com/svn/trunk@304 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 06:53:08 +00:00
David Huynh
084a6114d7 Track freebase types of columns added with data from Freebase, so that we can later add more data based on those columns. Fixed minor bug in serialization of data extension records.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@303 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 06:18:00 +00:00
Stefano Mazzocchi
cf95e5b5f6 freebase branding
git-svn-id: http://google-refine.googlecode.com/svn/trunk@302 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 01:46:32 +00:00
David Huynh
c6e7986206 Extend data operation is working.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@301 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 00:24:20 +00:00
David Huynh
2645c864ab We can now suggest CVT properties.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@300 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 20:41:08 +00:00
David Huynh
c30a5126df More work on the extend data preview dialog: columns can now be removed.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@299 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 19:44:33 +00:00
David Huynh
d0f77a5ef8 Minor layout tweak in clustering dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@298 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-15 18:48:39 +00:00