Stefano Mazzocchi
1b9cfbbf90
detabbing (no functional changes)
...
David, you might want to check your editor settings, you're mixing tabs with spaces
git-svn-id: http://google-refine.googlecode.com/svn/trunk@724 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 09:02:41 +00:00
Stefano Mazzocchi
11da70d223
Applying patch for Issue 21 from iainsproat
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@722 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 08:59:05 +00:00
David Huynh
8412aa72dd
Fixed Issue 17: Conflated triples - all rows are producing triple with "s" :" $Name_0".
...
Also exposed "id" field for recon objects.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@720 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 07:45:22 +00:00
David Huynh
992cb964eb
Fixed Issue 2: Undo History bug. The MassChange class didn't write out each elemental change's class and so they couldn't be read back in.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@719 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-12 06:35:01 +00:00
Stefano Mazzocchi
fc5dd69c33
wrong fix
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@602 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-05 01:50:54 +00:00
Stefano Mazzocchi
c32899aea6
clearing PMD warnings
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@600 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-05 01:42:08 +00:00
Stefano Mazzocchi
92ecc0c0f5
detab + dedos for java files (no functional changes)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@594 7d457c2a-affb-35e4-300a-418c747d4874
2010-05-04 23:24:48 +00:00
David Huynh
80a3e62d3e
Display progress percentage in window title while some long running process is occurring.
...
Expose recon candidates to expressions.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@579 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-30 22:49:59 +00:00
David Huynh
bab1e8905b
Jacked up jetty form upload size limit.
...
Added a few more array bound checks.
Reduced number of recon candidate and recon objects created by extend data operations.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@577 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-30 19:41:53 +00:00
David Huynh
89e1d8b5ac
Got history entries' IDs into Recon objects so we can track from a Recon object to all others created by the same operation.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@562 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-28 17:18:44 +00:00
David Huynh
4f1aaff486
Support new fields for Recon objects.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@561 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 23:21:21 +00:00
David Huynh
15c188ad7a
Added more metadata into recon objects.
...
Tried to minimize number of unique recon objects created when calling Recon.dup().
git-svn-id: http://google-refine.googlecode.com/svn/trunk@560 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 23:17:18 +00:00
David Huynh
e77b99e58b
For relevance service, auto-match only if the type matches, the score is at least 100, and if there is more than one result, the ratio of the first result's score over the second result's score must be at least 1.5.
...
For recon service, auto-match only if the result has match:true.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@559 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 21:38:47 +00:00
David Huynh
ca2bc0a304
Fixed null pointer exception problem in HeuristicReconConfig when trying to use "recon" service.
...
Made custom suggest widget rely on gridworks-helper acre app for fetching property suggestions.
Made various property suggest in recon dialog use our custom suggest widget.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@557 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-27 20:34:36 +00:00
David Huynh
3ae72ea630
Minor bug.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@548 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-26 01:15:02 +00:00
David Huynh
fed3c87fa6
Added row flagging support. Fixed bug in row star change: starring or unstarring one row wasn't undo-able previously.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@547 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-26 01:08:56 +00:00
David Huynh
0778b324de
Made facets' expressions editable.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@527 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-24 01:19:13 +00:00
David Huynh
f9a829758e
Pool recons and recon candidates. This yields smaller project files, change files, and AJAX responses for get-rows. It should make re-loading existing projects faster.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@521 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-23 19:39:12 +00:00
Stefano Mazzocchi
6990604981
implemented the full gridworks -> freebase conduit via delegated oauth and freeq/tripleloader
...
(still doesn't work as argus returns a 500 but the entire conduit is in place)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@519 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-23 08:25:52 +00:00
David Huynh
5ba67b7b26
Implemented column split command. It seems to be working in "by lengths" mode.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@510 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-20 23:08:14 +00:00
David Huynh
72f1f0956e
More polishing on the facet panel.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@498 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-18 00:56:09 +00:00
David Huynh
1d938bc4d0
Better MQL batching during extending data operations.
...
Tried to use JSON streaming in changes as well.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@479 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-15 00:09:14 +00:00
David Huynh
24a7ea91b6
Fixed bugs
...
- MassEditOperation was barfing when engineConfig was missing
- When parsing JSON in streaming mode, get long instead of int and double instead of float so that we won't get overflow exception.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@476 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 20:34:29 +00:00
Stefano Mazzocchi
3bae823010
fixed eclipsed warning (no functional change)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@473 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-14 02:39:24 +00:00
David Huynh
4a06c49a9a
Added streaming json parser for faster re-loading of existing projects.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@470 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-13 23:57:03 +00:00
David Huynh
8950e87e02
When re-loading existing projects from disk, cache recon objects by their IDs to lower memory consumption.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@437 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 19:32:50 +00:00
Stefano Mazzocchi
d3d40d608a
bunch of PMD-induced fixes
...
(now the PMD report is clean)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@430 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-09 00:14:11 +00:00
Stefano Mazzocchi
7526c4e582
cleanups (no functional changes)
...
this makes pmd and javac on linux happier
git-svn-id: http://google-refine.googlecode.com/svn/trunk@427 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-08 20:46:02 +00:00
David Huynh
9d9329ca96
Implemented row remove command.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@391 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 07:47:44 +00:00
David Huynh
1fd85c62bf
Implemented column rename command.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@390 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 07:15:34 +00:00
David Huynh
f402db10af
Implemented inter-project joins.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@387 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-06 05:35:48 +00:00
Stefano Mazzocchi
2efbf0031f
- removed the 'thirdparty' directory (now the 'gridworks' script will download and install needed tools if they are not present in the system already)
...
- added 'findbugs' command that uses the findbugs static analyzer to look for problems in the code
- fixed a bunch of issues that findbugs found (a few methods would go a little faster, and a few NPE will be avoided... nothing major but good to have)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@382 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-05 07:15:16 +00:00
Stefano Mazzocchi
4eda7ae2c0
avoid an array out of bounds exception in case there are no columns in the dataset
...
(I know, it should not happen but when it does let's not barf)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@375 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 20:21:41 +00:00
Stefano Mazzocchi
521acda025
- pass the svn revision as format version (for more detailed verification)
...
- add an 'autoreload' setting that makes Gridworks autoreload its self if a class gets changed
(this is useful to make development cycles faster when working on the java code with autocompiling IDE like Eclipse or IDEA)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@372 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 00:52:38 +00:00
David Huynh
1d0e6abaf8
Got some work done on the plane:
...
- better detection of record XML elements in XML importer
- XML importer creates column groups and data table view renders them
git-svn-id: http://google-refine.googlecode.com/svn/trunk@356 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-27 05:23:09 +00:00
David Huynh
4df1c4107a
Fixed a bug introduced recently: recon candidates were not serializing their topic types for the data view, so in the data view we can't send back a candidate's types when the user wants to match the candidate to some cells. I need to figure out a better way to optimize this.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@350 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 03:58:52 +00:00
David Huynh
85d1671d6e
Fixed minor bug: recon wasn't saving out its candidates if its judgment is Matched. So when a project is saved and reloaded, it loses all of the recon candidates except for the matches.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@344 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 18:02:08 +00:00
David Huynh
2846d66261
Detect max cell index on load, just in case the max cell index we've stored previously was out of whack.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@341 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 03:19:17 +00:00
David Huynh
f8d30e9e8e
Don't send back recon candidate types for rendering cells.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@340 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 01:17:45 +00:00
David Huynh
a43b2a72c1
Made various GEL functions and the forEach control work with java.util.List and java.util.Collection in addition to just Object[].
...
Added field columnNames to row object.
Added 1-bounded numeric log facet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@328 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 23:04:17 +00:00
David Huynh
cd062cf028
Minor bug: recon candidate's "id" field should return id, not name.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@312 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-17 19:54:27 +00:00
David Huynh
084a6114d7
Track freebase types of columns added with data from Freebase, so that we can later add more data based on those columns. Fixed minor bug in serialization of data extension records.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@303 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 06:18:00 +00:00
David Huynh
c6e7986206
Extend data operation is working.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@301 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-16 00:24:20 +00:00
David Huynh
025eccce4b
Implemented "record" field for each row.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@283 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 06:33:03 +00:00
David Huynh
af3cb76056
Added support for including dependent rows in row visiting. Facets still don't count them, though.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@282 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-12 01:06:23 +00:00
David Huynh
e760750b57
Fixed minor bug that prevented column details from getting passed on to recon service.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@280 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-11 21:55:32 +00:00
David Huynh
b1fca11342
Made recon use cells from context rows.
...
Fixed bug in menu left-right positioning.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@271 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 08:32:20 +00:00
David Huynh
6bf5418f9d
Cell changes should also flush column precomputes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@267 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 07:42:57 +00:00
David Huynh
e008332399
- make recon changes flush column precomputes
...
- fixed bug where recon features are not saved to file properly
- support selecting non-numeric, blank, and error choices in numeric range facets
git-svn-id: http://google-refine.googlecode.com/svn/trunk@265 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-10 06:02:36 +00:00
David Huynh
5d3a57eeeb
Implemented project import and export commands (from/to .tar files).
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@234 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-08 02:34:25 +00:00