Commit Graph

72 Commits

Author SHA1 Message Date
Stefano Mazzocchi
fcc54e2ab3 removing what turned out to be dead code
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1162 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-16 16:09:52 +00:00
Stefano Mazzocchi
bb7d3c388c ISSUE-115 datePart('month') should return January as 1 not 0
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1161 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-16 16:09:39 +00:00
David Huynh
a90a9c724e Forgot to register blank down operation in operation registry previously.
Added uniques GEL function for eliminating duplicates in an array.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1158 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-16 03:00:43 +00:00
David Huynh
fa816007a7 Fixed copy-and-paste string mistake in BlankDownOperation.
Fixed minor bug in Row.isValueBlank that returns true for non-string values.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1157 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-16 02:16:41 +00:00
David Huynh
e61655506a Added new command to import QA results, so any reconciliation action that yields conflicting or uncertain opinions among reviewers can be examined inside Gridworks.
Added new customized facets for checking QA results. 

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1156 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-13 16:26:33 +00:00
David Huynh
8f071ede31 Added command Transpose Cells in Rows into Columns (Issue 82).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1147 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-08 06:30:30 +00:00
David Huynh
d1a66e2e63 Added JSON support in GEL.
Added GEL functions: escape, parseJson, hasField.
Fixed bug in preference store: expression history was still not loaded properly.
Integers are now rendered without decimals in the expression preview dialogs.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1145 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-07 22:57:48 +00:00
David Huynh
e70f16025b Fixed bug introduced recently by changing the preference key of the expression history from "expressions" to "scripting.expressions".
Added code in FileProjectManager for trying to recover projects in the workspace dir but are not recorded in the workspace json file.


git-svn-id: http://google-refine.googlecode.com/svn/trunk@1144 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-07 20:25:31 +00:00
David Huynh
0500d7aa10 Added commands Move Column to Beginning, Move Column to End, Move Column Left, Move Column Right.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1142 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-07 01:24:48 +00:00
David Huynh
f0eae04c0c Forgot to add 2 files in the last commit
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1141 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-06 23:50:22 +00:00
David Huynh
a8ee9b9e08 Added Fill Down and Blank Down commands.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1140 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-06 20:33:28 +00:00
David Huynh
3bda9d035d Added support for creating a project by pointing to a data file URL.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1139 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-06 06:15:05 +00:00
David Huynh
f411dc9104 - Issue 112: Refactor Importer API (patch from tfmorris)
- Added support for storing custom metadata in ProjectMetadata.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1138 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-06 05:04:25 +00:00
David Huynh
00c6865d95 - Select All and Unselect All buttons in History Extract dialog
- Schema skeleton: support for multiple cells per cell-as nodes, and for conditional links


git-svn-id: http://google-refine.googlecode.com/svn/trunk@1137 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-05 20:27:39 +00:00
David Huynh
5cb3f924f6 Added support in protograph for specifying several column names per cell-as nodes.
Started to add support for conditional links in protograph. The UI is not hooked up with.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1136 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-05 08:29:34 +00:00
David Huynh
b8ad56c6db Made sure in the schema skeleton dialog, in the dialog box for a node, in the "cell-as-topic" section, the type is always recorded.
In the triple loader transposed node factory, use the column's recon config to generate new topics' type.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1135 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-05 00:53:08 +00:00
David Huynh
dcc3ac8534 Renamed packages com.metaweb.* to com.google.*.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1130 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-03 23:01:18 +00:00
Stefano Mazzocchi
8c56b437fa more fixes
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1129 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-03 20:19:48 +00:00
David Huynh
762a9f13eb Text facet's choice count limit is now configurable through preference page. Preference page needs polishing.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1127 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-02 01:49:10 +00:00
David Huynh
965ef20790 Made sure commands that create new columns check for duplicate column names.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1126 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-01 04:44:21 +00:00
David Huynh
4ad31ffcde Excel importer now supports "header lines" parameter.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1125 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-01 04:22:45 +00:00
David Huynh
7bb6674e5b Fixed recently introduced bug: expressions were not logged because preference stores were not initialized properly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1124 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-31 07:01:44 +00:00
David Huynh
f069780bfa Added support for bundling .js files to shave off some loading time.
For GetRowsCommand, tried to use jsonp but that didn't seem to improve performance much.
Gzip http responses of various text-based mime types.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1122 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-31 06:51:11 +00:00
David Huynh
d71d84194f Register new operation Transpose Cells in Columns into Rows.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1112 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-30 02:28:33 +00:00
David Huynh
ee14955605 Added new command Transpose Cells in Columns into Rows.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1111 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-30 02:25:58 +00:00
David Huynh
a192674118 - added smartSplit GEL function that can handle quoted values
- added max width to operation extract dialog
- made GEL get and slice functions handle HasFieldsList
- fixed versioned standard-reconcile URLs (they need userid.user.dev)

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1110 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-26 22:50:00 +00:00
David Huynh
2ff0184c65 - switched to accessing versioned standard-reconcile app
- standardized preference keys to using dot separated format
- added support to override freeq url from workspace preferences
- added GEL controls: forEachIndex, forRange, filter
- enforced max-width on preview table columns in expression preview dialog
- added preservedAllTokens option to split GEL function

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1109 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-26 21:12:40 +00:00
David Huynh
4522b98f32 Store and use job ID to retrieve MDO ID and send that in subsequent loads.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1100 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-16 17:32:06 +00:00
David Huynh
4373e7276f Pass target Freebase type IDs in recon objects to freeq.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1099 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-15 21:45:17 +00:00
David Huynh
b854f99ef5 Removed extra closing brace.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1096 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-11 05:54:06 +00:00
David Huynh
43dadf40da Added ignore:true to any triple that shouldn't be loaded.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1095 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-10 05:07:24 +00:00
David Huynh
513283d4d1 Support creation of cache directories, so the rdf importer can store its lucene indexes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1090 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-08 23:22:29 +00:00
David Huynh
f5fc44e24e Refactoring to expose extension points that the rdf-exporter extension will plug into.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1074 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-06 00:14:07 +00:00
David Huynh
ab82562016 Tripleloader protograph transposer now generates more context information for QA.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1073 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-03 01:39:14 +00:00
David Huynh
217fb7b25c Fixed Issue 66: Records not excluded with inverted text facet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1064 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-01 20:26:54 +00:00
Stefano Mazzocchi
a682d6b36f fixing eclipse warnings
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1063 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-01 08:03:42 +00:00
Stefano Mazzocchi
9fbff0640b make sure that splitting values maintains empty cells if the separator is repeated
(this is useful in case the cells contains a rigid structure across multiple columns)

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1062 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-01 07:47:57 +00:00
Stefano Mazzocchi
2302d017d8 remove eclipse warnings
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1061 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-01 07:47:52 +00:00
David Huynh
18b720b913 Fixed CSV and TSV export bug.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1059 7d457c2a-affb-35e4-300a-418c747d4874
2010-07-01 02:32:03 +00:00
David Huynh
2e3984d54a When transposing data to triple loader output, pass row indices and cell indices deep down so later we can generate more context information for recon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1051 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-29 17:30:16 +00:00
David Huynh
0e4781cb58 Forgot a console.log() call.
Allow reconciling against no particular type.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1043 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-27 04:20:35 +00:00
David Huynh
76c8cd77eb "search for match" links in data table cells now use recon service's entity suggest options.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1041 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-27 03:31:56 +00:00
David Huynh
ecfb893e98 More work on the recon UI. Standard services can now be added.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1038 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-26 01:10:23 +00:00
David Huynh
1342ceacea Careful not to load all projects in an autosave cycle.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1037 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-25 22:43:30 +00:00
David Huynh
058e86b4c8 First pass in trying to generalize standard reconciliation service UI. A lot of pieces are still Freebase-centric.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1032 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-24 20:13:51 +00:00
Iain Sproat
f0ed50e468 issue 69 fixed. ControlFunctionRegistry now correctly registers Chomp expression as "chomp" key.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1024 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-23 17:53:29 +00:00
David Huynh
a9f77d0f51 Minor bug.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1020 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-22 21:29:15 +00:00
Iain Sproat
0d7b3b0e9c ProjectManager is now partially unit tested.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1015 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-21 19:57:31 +00:00
Iain Sproat
dcf6919900 Functionality which didn't need to be moved to FileProjectManager as it wasn't file system specific has been moved back to ProjectManager. importProject function is now named loadProjectMetadata to avoid confusion.
Some additional source code documentation added to ProjectManager, and methods rearranged in more readable fashion.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1011 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-20 16:32:49 +00:00
Iain Sproat
7ced0cb31e New feature for importing text files (CSV and TSV). By selecting the checkbox in index.html it allows the effects of quotation marks around data values to be ignored.
Unit test added for this.

This has required a further branch to opencsv - patch sent to opencsv project and can be tracked at  https://sourceforge.net/tracker/?func=detail&aid=3018599&group_id=148905&atid=773543

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1010 7d457c2a-affb-35e4-300a-418c747d4874
2010-06-20 14:47:45 +00:00