Commit Graph

825 Commits

Author SHA1 Message Date
Stefano Mazzocchi
8cf69301a5 added a new command to get column metadata prior of creating the scatterplot half-matrix, this allows us to build a much more compact table and make the browser crawl a little less
git-svn-id: http://google-refine.googlecode.com/svn/trunk@481 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 06:20:56 +00:00
David Huynh
155b5a483a When deleting project dirs, we need to recurse into them ourselves.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@480 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 00:17:08 +00:00
David Huynh
1d938bc4d0 Better MQL batching during extending data operations.
Tried to use JSON streaming in changes as well.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@479 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 00:09:14 +00:00
David Huynh
2277f45ef6 For jython, wrap native values properly using Py.java2py().
git-svn-id: http://google-refine.googlecode.com/svn/trunk@478 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 21:30:39 +00:00
Will Moffat
6be29f712c Fix previous fix.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@477 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 21:18:27 +00:00
David Huynh
24a7ea91b6 Fixed bugs
- MassEditOperation was barfing when engineConfig was missing
- When parsing JSON in streaming mode, get long instead of int and double instead of float so that we won't get overflow exception.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@476 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 20:34:29 +00:00
Will Moffat
70c6f901ac Fix: build on mac
git-svn-id: http://google-refine.googlecode.com/svn/trunk@475 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 18:17:15 +00:00
Stefano Mazzocchi
ad1b920e0e better selection procedure for the scatterplot facet
(the scatterfacet itself still doesn't work though)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@474 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 05:14:33 +00:00
Stefano Mazzocchi
3bae823010 fixed eclipsed warning (no functional change)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@473 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 02:39:24 +00:00
Stefano Mazzocchi
7680d98b49 synch credits
git-svn-id: http://google-refine.googlecode.com/svn/trunk@472 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 02:14:46 +00:00
Stefano Mazzocchi
2e1c8e688c bunch of fixes to make this script working also under cygwin
(this is useful mostly to run automated ui testing on windows)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@471 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 01:37:56 +00:00
David Huynh
4a06c49a9a Added streaming json parser for faster re-loading of existing projects.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@470 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 23:57:03 +00:00
Will Moffat
8142b27ee4 Fixed: Mac OS X default to Java 5. export JAVA_HOME to Java 6
git-svn-id: http://google-refine.googlecode.com/svn/trunk@469 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 22:14:02 +00:00
David Huynh
a1a8758c37 Added options for specifying # lines the header columns take, and the # lines to skip processing entirely initially.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@468 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 21:23:41 +00:00
David Huynh
da55033df3 Forgot a console.log() in history-widget.js.
Updated view header to look like James' new design.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@467 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 08:06:05 +00:00
David Huynh
01fbca0fe6 Added help message when the facet panel is empty.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@466 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:52:01 +00:00
Stefano Mazzocchi
dc4b63d2bf forgot a piece
git-svn-id: http://google-refine.googlecode.com/svn/trunk@465 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:27:20 +00:00
Stefano Mazzocchi
de9dfbfba7 got scatterfacet to show \.o./
git-svn-id: http://google-refine.googlecode.com/svn/trunk@464 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:26:26 +00:00
David Huynh
84bc49b3d5 Got the history panel to resize properly in its tab.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@463 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:08:23 +00:00
Stefano Mazzocchi
ea2eff6117 transpose the scatterplot matrix (makes better use of space above the fold)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@462 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 06:40:33 +00:00
David Huynh
a2db5590ac Trim column names on import.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@461 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 06:28:13 +00:00
David Huynh
a0b119b346 Polished list facet a bit more to look like James' design.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@460 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 05:41:32 +00:00
David Huynh
93a6be6ac6 More styling work on the facets/history tabs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@459 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 04:49:28 +00:00
David Huynh
666bcdbf8d Hooked scatterplots up to generate the right facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@458 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 04:48:20 +00:00
Stefano Mazzocchi
e232a90a73 progress but still no worky on the scatterfacet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@457 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 03:44:42 +00:00
Stefano Mazzocchi
ba85f50e39 adding log-log support to the scatterplot matrix and more controls
(the scatterfacet still doesn't work but this is already more useful)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@456 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 00:25:43 +00:00
David Huynh
7391f760db Moved history widget and facet panel into tabs on the left according to James' new design. More polishing is needed.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@455 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 23:48:48 +00:00
David Huynh
8b95248c75 Fixed bug where after reconciling by ID, GUID, or key would generate a buggy numeric range facet, since all the scores were artificially the same.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@454 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 22:19:44 +00:00
Stefano Mazzocchi
7ab1acd801 skeleton code for scatterfacet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@453 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 19:22:49 +00:00
Stefano Mazzocchi
e154a7e02d increase the thread count... it seems high but I think browsers heavily cache HTTP connections and using multiple browsers can actually starve us
git-svn-id: http://google-refine.googlecode.com/svn/trunk@452 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 19:22:23 +00:00
Stefano Mazzocchi
60d61b7808 add commons-math library (I'm going to need this for more advanced facets)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@451 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 04:25:50 +00:00
Stefano Mazzocchi
52b4fd556f more robust checking of java presence
git-svn-id: http://google-refine.googlecode.com/svn/trunk@450 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 04:24:17 +00:00
David Huynh
f6dcdf23c4 Added temporary message for the Load into Freebase feature.
Fixed command Match Filtered Cells To, which didn't work because freebase suggest no longer returns types.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@449 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 03:59:19 +00:00
David Huynh
8fb23913ce Added "time" part option to datePart function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@448 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 23:07:56 +00:00
David Huynh
ce8963d009 Added datePart function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@447 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 23:01:34 +00:00
David Huynh
f7e830e709 Fixed bug in which editing a single cell and then starring the same row seemed to revert the cell back to its original content.
Added an option for not guessing cell value type during import.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@446 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 21:54:56 +00:00
David Huynh
75ea8304a3 In numeric range facets, show the other choices only if they have positive counts in the base distribution.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@445 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 08:19:05 +00:00
David Huynh
5928a689e2 Use RowParser for parsing the header row, too.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@444 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 03:42:44 +00:00
Stefano Mazzocchi
73288c5e2a Fixed a bug in case the property is not defined. (thanks to Iain Sproat for the patch)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@443 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 16:58:00 +00:00
Stefano Mazzocchi
81fb2f1740 first step at scatterplot facet selector
git-svn-id: http://google-refine.googlecode.com/svn/trunk@442 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:28:06 +00:00
Stefano Mazzocchi
ed0778f18d fixed a problem I introduced earlier with a jsunit scope fix
git-svn-id: http://google-refine.googlecode.com/svn/trunk@441 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:16:40 +00:00
Stefano Mazzocchi
a286b805d5 no reason to process the contextual rows while clustering
git-svn-id: http://google-refine.googlecode.com/svn/trunk@440 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:11:31 +00:00
Stefano Mazzocchi
924f520fbf turns out we have been distributing the mac version of gridoworks with the default JVM max_heap settings... no wonder why so many people were getting OutOfMemory errors
git-svn-id: http://google-refine.googlecode.com/svn/trunk@439 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 01:36:35 +00:00
David Huynh
70449cf7c8 Better error catching in toNumber function.
Watch out for the string "Infinity" while importing data sets: don't parse it into a double.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@438 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 21:59:50 +00:00
David Huynh
8950e87e02 When re-loading existing projects from disk, cache recon objects by their IDs to lower memory consumption.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@437 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 19:32:50 +00:00
David Huynh
dff870519e Made numeric range index handle value.log() properly when value is 0 or negative.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@436 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 05:24:46 +00:00
David Huynh
a0d8c385f9 Do a bit more checking when retrieving project metadata just in case project metadata is null.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@435 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 04:52:32 +00:00
Stefano Mazzocchi
4a139842d4 more jslint cleanups
git-svn-id: http://google-refine.googlecode.com/svn/trunk@434 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 01:48:50 +00:00
Stefano Mazzocchi
cc2209074b more jslint goodness
git-svn-id: http://google-refine.googlecode.com/svn/trunk@433 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 01:14:33 +00:00
Stefano Mazzocchi
16a2600a49 now it's jslint time to be happier: (!= null) -> (!== null)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@432 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 01:00:44 +00:00