Commit Graph

377 Commits

Author SHA1 Message Date
David Huynh
6450921c02 Fixed issue 4: Match All bug with ZIP code.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@767 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-14 17:03:33 +00:00
David Huynh
bd87e079b2 Fixed issue 34: Behavior of Text Filter is unpredictable when "regular expression" mode is enabled.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@766 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-14 16:51:31 +00:00
Stefano Mazzocchi
ea459aed07 Applied a bunch of patches from Tom Morris (Issue 25, 26 and 27)
- make java6 dependency explicit in eclipse project files
- avoid using NotImplementException especially the sun.* one
- avoid using internal sun signal handling and rely on standard java.* APIs
 (I tested this one and it seems to be working fine)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@756 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-13 21:02:19 +00:00
Iain Sproat
a096fae7a0 ParsingUtilities.evaluateJsonStringToObject is now unit tested
CommandTests and CancelProcessesCommandTests are slightly formatted to match code standards.
Command and CancelProcessesCommand are slightly tweaked to pass under all unit tests.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@753 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-13 09:23:37 +00:00
Stefano Mazzocchi
5070de6446 committing Iain's new tests and polished things a little
(also added a test-specific log4j conf so that we don't show all the warnings during the tests that were distracting)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@744 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-13 00:54:02 +00:00
Stefano Mazzocchi
1b9cfbbf90 detabbing (no functional changes)
David, you might want to check your editor settings, you're mixing tabs with spaces


git-svn-id: http://google-refine.googlecode.com/svn/trunk@724 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 09:02:41 +00:00
Stefano Mazzocchi
11da70d223 Applying patch for Issue 21 from iainsproat
git-svn-id: http://google-refine.googlecode.com/svn/trunk@722 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 08:59:05 +00:00
Stefano Mazzocchi
fe0afa0bc3 Fixed Issue #18
git-svn-id: http://google-refine.googlecode.com/svn/trunk@721 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 07:53:26 +00:00
David Huynh
8412aa72dd Fixed Issue 17: Conflated triples - all rows are producing triple with "s" :" $Name_0".
Also exposed "id" field for recon objects.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@720 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 07:45:22 +00:00
David Huynh
992cb964eb Fixed Issue 2: Undo History bug. The MassChange class didn't write out each elemental change's class and so they couldn't be read back in.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@719 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 06:35:01 +00:00
David Huynh
7ad438078f Fixed Issue 19: CSV import is too basic. Quoted cells in a CSV file can now contain line breaks. Added a CSV file to test that.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@717 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 06:07:44 +00:00
Stefano Mazzocchi
a643eabc88 fixed eclipse warning
git-svn-id: http://google-refine.googlecode.com/svn/trunk@710 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-11 18:18:07 +00:00
David Huynh
f341fa8db2 Fixed Issue 13: float rejected from sandbox upload as Json object.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@708 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-11 06:45:49 +00:00
David Huynh
114be582e8 Fixed issue 5: Localized Windows cause save problems for Gridworks. jdatapath library returns a path in which each unicode character (in the user ID) has been replaced by ?. So we need to grab the actual user ID from environment variables such as APPDATA or USERPROFILE.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@707 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-11 04:16:26 +00:00
Stefano Mazzocchi
a626ea51ff if the user's computer's clock is not synchronized with Freebase's, the initial oauth handshake will fail because Freebase believe there is a replay attack going on. To avoid that we ask Acre to tell us what time it is in Freebase land and we sign the request with that instead. We default back to the user clock only if Acre is down or swamped.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@704 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-10 23:24:58 +00:00
David Huynh
60ef31012a Check for missing "target" in protograph link node's serialized json object when reloading from project files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@702 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-10 20:49:45 +00:00
David Huynh
2161255cdb Fixed issue 11: missing "lang" attribute in MQL generated in schema alignment.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@700 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-10 20:08:53 +00:00
David Huynh
9fa9401ccb Fixed a bug in which if the target of a protograph link was somehow missing, the project.html wouldn't load.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@665 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-09 19:35:42 +00:00
David Huynh
510df55ea5 Oops, forgot to change the autosave period back to 5 minutes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@628 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-09 04:37:36 +00:00
David Huynh
8c03c1ddcf Prevent autosave timer events from bunching up when the computer is put into sleep mode.
Don't autosave while creating or importing projects, exporting rows, or uploading data to Freebase. Those are potentially intensive operations.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@627 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-09 04:34:36 +00:00
David Huynh
33895a7a6d In scatterplot facets, made sure in log/log mode no point falls in the negative quadrants.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@615 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-06 22:49:19 +00:00
David Huynh
fae6701493 Added support for exporting a scatterplot facet's image as a large image.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@614 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-06 20:44:40 +00:00
David Huynh
f37d24e04a Scatterplot facets:
- Changed coordinate systems of selection to zero to one, as fraction of plot's size
- Used affine transform to do proper plot rotation
- Filled negative quadrants of plot to indicate rotation
- Set default dot size of actual facet's plot to regular

git-svn-id: http://google-refine.googlecode.com/svn/trunk@612 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-06 19:34:51 +00:00
Stefano Mazzocchi
fc5dd69c33 wrong fix
git-svn-id: http://google-refine.googlecode.com/svn/trunk@602 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-05 01:50:54 +00:00
Stefano Mazzocchi
c32899aea6 clearing PMD warnings
git-svn-id: http://google-refine.googlecode.com/svn/trunk@600 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-05 01:42:08 +00:00
Stefano Mazzocchi
e6d36710ff findbug cleanups
git-svn-id: http://google-refine.googlecode.com/svn/trunk@599 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-05 01:35:51 +00:00
Stefano Mazzocchi
92ecc0c0f5 detab + dedos for java files (no functional changes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@594 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-04 23:24:48 +00:00
David Huynh
883fc65304 Minor bug: blank maxColumns param caused SplitColumnCommand to throw an exception.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@587 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-04 16:14:13 +00:00
David Huynh
9641f28fe0 Made extend data from freebase command available on columns not officially reconciled, since some columns might contain reconciled data copied from other columns.
More error checking in the extend data utility.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@583 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-02 22:27:17 +00:00
David Huynh
d303adc48e Made data upload dialog shows only limited preview of triples, but made actual uploading process and the tripleloader exporter generate all triples. Added spinner busy dialog during uploading process.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@582 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-01 00:11:44 +00:00
Stefano Mazzocchi
b6ed5a3df5 moving to freeq for real
git-svn-id: http://google-refine.googlecode.com/svn/trunk@581 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-30 23:21:17 +00:00
David Huynh
80a3e62d3e Display progress percentage in window title while some long running process is occurring.
Expose recon candidates to expressions.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@579 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-30 22:49:59 +00:00
David Huynh
bab1e8905b Jacked up jetty form upload size limit.
Added a few more array bound checks.
Reduced number of recon candidate and recon objects created by extend data operations.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@577 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-30 19:41:53 +00:00
Stefano Mazzocchi
ce40122754 - fixed oauth problem with non-127.0.0.1 hosts
- fixed scatterfacet filtering consistency
- increased size of scatterplot in the scatterfacet


git-svn-id: http://google-refine.googlecode.com/svn/trunk@573 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-30 06:12:42 +00:00
David Huynh
3f40195ea1 Implemented but disabled the denormalize operation.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@571 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-29 22:07:07 +00:00
David Huynh
270f147d92 Support inverting list facet selection.
Make sure blank and error selections are saved and restored in permanent links.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@570 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-29 21:22:28 +00:00
David Huynh
17c9b65889 Made (blank) and (error) choices in list facets editable, too.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@569 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-29 20:18:06 +00:00
David Huynh
cf01dcd965 In column addition and text transform operations, for expressions that evaluate to cells or wrapped cells, use the whole cells as the result cells. This effectively copies their recon objects as well.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@568 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-29 19:44:39 +00:00
David Huynh
890cbffb67 Added recon metadata to tripleloader payload.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@564 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-28 22:04:39 +00:00
David Huynh
89e1d8b5ac Got history entries' IDs into Recon objects so we can track from a Recon object to all others created by the same operation.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@562 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-28 17:18:44 +00:00
David Huynh
4f1aaff486 Support new fields for Recon objects.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@561 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 23:21:21 +00:00
David Huynh
15c188ad7a Added more metadata into recon objects.
Tried to minimize number of unique recon objects created when calling Recon.dup().

git-svn-id: http://google-refine.googlecode.com/svn/trunk@560 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 23:17:18 +00:00
David Huynh
e77b99e58b For relevance service, auto-match only if the type matches, the score is at least 100, and if there is more than one result, the ratio of the first result's score over the second result's score must be at least 1.5.
For recon service, auto-match only if the result has match:true.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@559 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 21:38:47 +00:00
David Huynh
ca2bc0a304 Fixed null pointer exception problem in HeuristicReconConfig when trying to use "recon" service.
Made custom suggest widget rely on gridworks-helper acre app for fetching property suggestions.
Made various property suggest in recon dialog use our custom suggest widget.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@557 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 20:34:36 +00:00
Stefano Mazzocchi
5cd0301e57 make sure that users can't bypass easily the upload badge checks simply by tweaking dom values from firebug
git-svn-id: http://google-refine.googlecode.com/svn/trunk@556 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 19:47:12 +00:00
Stefano Mazzocchi
b1375a8997 more polish
git-svn-id: http://google-refine.googlecode.com/svn/trunk@555 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 08:13:45 +00:00
Stefano Mazzocchi
0eb18633e6 implemented more conservative data loading workflow
git-svn-id: http://google-refine.googlecode.com/svn/trunk@554 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 07:13:11 +00:00
Stefano Mazzocchi
e6012bc14a Fixes for Freeq
git-svn-id: http://google-refine.googlecode.com/svn/trunk@552 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 00:18:31 +00:00
David Huynh
fece6187bf Jython libraries should now be properly imported on Windows as well.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@551 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-26 17:57:52 +00:00
David Huynh
53d7bd3287 Another star to flag copy and paste bug.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@549 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-26 01:18:57 +00:00
David Huynh
3ae72ea630 Minor bug.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@548 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-26 01:15:02 +00:00
David Huynh
fed3c87fa6 Added row flagging support. Fixed bug in row star change: starring or unstarring one row wasn't undo-able previously.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@547 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-26 01:08:56 +00:00
David Huynh
a734a9c6cb Initialize the jython library with the custom lib/jython/ path if we're running as a packaged app on Mac OSX.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@542 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-25 20:05:43 +00:00
David Huynh
007da57c1e More work on the extend data dialog. The suggested properties are now populated by the gridworks-helper acre app. Constraints can be specified per column, in the free form of a MQL query. It's a temporary solution.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@540 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-25 06:39:03 +00:00
Stefano Mazzocchi
1a6d1cf6b2 more polish
git-svn-id: http://google-refine.googlecode.com/svn/trunk@528 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-24 02:36:56 +00:00
David Huynh
0778b324de Made facets' expressions editable.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@527 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-24 01:19:13 +00:00
David Huynh
f9a829758e Pool recons and recon candidates. This yields smaller project files, change files, and AJAX responses for get-rows. It should make re-loading existing projects faster.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@521 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-23 19:39:12 +00:00
Stefano Mazzocchi
3e37970540 polishing (no functional changes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@520 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-23 17:30:18 +00:00
Stefano Mazzocchi
6990604981 implemented the full gridworks -> freebase conduit via delegated oauth and freeq/tripleloader
(still doesn't work as argus returns a 500 but the entire conduit is in place)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@519 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-23 08:25:52 +00:00
Stefano Mazzocchi
439474caeb Checkpoint for OAuth functionality in Gridworks
(doesn't work but since it's a substantial chunk of stuff, I want to get it in sooner rather than later)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@516 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-21 21:08:34 +00:00
David Huynh
5ba67b7b26 Implemented column split command. It seems to be working in "by lengths" mode.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@510 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-20 23:08:14 +00:00
Vishal Talwar
d0df704d8a added python code part of jython distribution in lib/jython-2.5.1
added python.path vm arg to startup script
fixed infinite loop in unwrap() when displaying sequences of sequences



git-svn-id: http://google-refine.googlecode.com/svn/trunk@509 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-20 18:50:24 +00:00
David Huynh
35da36b0e8 Fixed misspell in clustering dialog.
Added option for not splitting lines into columns on import.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@508 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-20 07:26:07 +00:00
David Huynh
d85a0e1851 Retrieve dates correctly from Excel files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@507 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-20 04:43:39 +00:00
David Huynh
2226d77c27 Oops, minor bug in range facet introduced in last check-in.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@504 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-19 23:45:18 +00:00
David Huynh
d1b0de95de Made our own slider widget to use in conjunction with our histogram widget.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@503 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-19 23:27:57 +00:00
David Huynh
72f1f0956e More polishing on the facet panel.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@498 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-18 00:56:09 +00:00
David Huynh
3b63e0b969 Scatterplot facet can now filter the rows.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@492 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 05:59:25 +00:00
Stefano Mazzocchi
85d7ed6b89 cleanup
git-svn-id: http://google-refine.googlecode.com/svn/trunk@491 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 03:02:34 +00:00
Stefano Mazzocchi
7a716a4a1b - updgraded commons-coded to the last version (needed for base64 encoding of data: uris)
- added the ability to embed the scatterplot inside the returned json data with data: uris (although it doesn't seem to work well)
- connected the selection logic to the scatterfacets (although it doesn't seem to filter the rows... and I'm puzzled as why)
- reduced cut/paste and code overlap between the scatterplot generator and the scatterplot facet


git-svn-id: http://google-refine.googlecode.com/svn/trunk@490 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-17 03:00:38 +00:00
David Huynh
8085208cf0 Fixed toTitlecase to handle fully capitalized text.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@489 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 23:42:52 +00:00
David Huynh
9e73a4e68c Started to work on a MARC importer. It doesn't work properly yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@486 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 19:52:01 +00:00
David Huynh
67662fcc96 Escape strings from TSV exporter.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@485 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-16 06:45:57 +00:00
Stefano Mazzocchi
1e5a787281 avoid ArrayOutOfBoundsException
git-svn-id: http://google-refine.googlecode.com/svn/trunk@484 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 20:24:23 +00:00
Stefano Mazzocchi
397861b612 - replace the 'cos' library with the apache 'commons-fileupload' for licensing reason (the cos library had a weird arm-twisting license that forced you to buy an o'reilly book on servlets for each developer in your company... good thing I read it all)
- some tweaks on imgareaselect's look


git-svn-id: http://google-refine.googlecode.com/svn/trunk@483 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 18:42:41 +00:00
Stefano Mazzocchi
8cf69301a5 added a new command to get column metadata prior of creating the scatterplot half-matrix, this allows us to build a much more compact table and make the browser crawl a little less
git-svn-id: http://google-refine.googlecode.com/svn/trunk@481 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 06:20:56 +00:00
David Huynh
155b5a483a When deleting project dirs, we need to recurse into them ourselves.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@480 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 00:17:08 +00:00
David Huynh
1d938bc4d0 Better MQL batching during extending data operations.
Tried to use JSON streaming in changes as well.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@479 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 00:09:14 +00:00
David Huynh
2277f45ef6 For jython, wrap native values properly using Py.java2py().
git-svn-id: http://google-refine.googlecode.com/svn/trunk@478 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 21:30:39 +00:00
David Huynh
24a7ea91b6 Fixed bugs
- MassEditOperation was barfing when engineConfig was missing
- When parsing JSON in streaming mode, get long instead of int and double instead of float so that we won't get overflow exception.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@476 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 20:34:29 +00:00
Stefano Mazzocchi
3bae823010 fixed eclipsed warning (no functional change)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@473 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 02:39:24 +00:00
David Huynh
4a06c49a9a Added streaming json parser for faster re-loading of existing projects.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@470 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 23:57:03 +00:00
David Huynh
a1a8758c37 Added options for specifying # lines the header columns take, and the # lines to skip processing entirely initially.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@468 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 21:23:41 +00:00
Stefano Mazzocchi
dc4b63d2bf forgot a piece
git-svn-id: http://google-refine.googlecode.com/svn/trunk@465 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 07:27:20 +00:00
David Huynh
a2db5590ac Trim column names on import.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@461 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 06:28:13 +00:00
Stefano Mazzocchi
e232a90a73 progress but still no worky on the scatterfacet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@457 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 03:44:42 +00:00
Stefano Mazzocchi
ba85f50e39 adding log-log support to the scatterplot matrix and more controls
(the scatterfacet still doesn't work but this is already more useful)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@456 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 00:25:43 +00:00
David Huynh
8b95248c75 Fixed bug where after reconciling by ID, GUID, or key would generate a buggy numeric range facet, since all the scores were artificially the same.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@454 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 22:19:44 +00:00
Stefano Mazzocchi
7ab1acd801 skeleton code for scatterfacet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@453 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-12 19:22:49 +00:00
David Huynh
8fb23913ce Added "time" part option to datePart function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@448 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 23:07:56 +00:00
David Huynh
ce8963d009 Added datePart function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@447 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 23:01:34 +00:00
David Huynh
f7e830e709 Fixed bug in which editing a single cell and then starring the same row seemed to revert the cell back to its original content.
Added an option for not guessing cell value type during import.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@446 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 21:54:56 +00:00
David Huynh
75ea8304a3 In numeric range facets, show the other choices only if they have positive counts in the base distribution.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@445 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 08:19:05 +00:00
David Huynh
5928a689e2 Use RowParser for parsing the header row, too.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@444 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-11 03:42:44 +00:00
Stefano Mazzocchi
81fb2f1740 first step at scatterplot facet selector
git-svn-id: http://google-refine.googlecode.com/svn/trunk@442 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:28:06 +00:00
Stefano Mazzocchi
a286b805d5 no reason to process the contextual rows while clustering
git-svn-id: http://google-refine.googlecode.com/svn/trunk@440 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-10 08:11:31 +00:00
David Huynh
70449cf7c8 Better error catching in toNumber function.
Watch out for the string "Infinity" while importing data sets: don't parse it into a double.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@438 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 21:59:50 +00:00
David Huynh
8950e87e02 When re-loading existing projects from disk, cache recon objects by their IDs to lower memory consumption.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@437 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 19:32:50 +00:00
David Huynh
dff870519e Made numeric range index handle value.log() properly when value is 0 or negative.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@436 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 05:24:46 +00:00
David Huynh
a0d8c385f9 Do a bit more checking when retrieving project metadata just in case project metadata is null.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@435 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 04:52:32 +00:00