Commit Graph

171 Commits

Author SHA1 Message Date
David Huynh
194fb5e706 Fixed Issue 122: Exporting to Excel on attached project raises server exception
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1370 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 03:44:30 +00:00
David Huynh
f2ce1b7161 Fixed Issue 121: Importing attached file strips backslashes
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1369 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 03:35:42 +00:00
Stefano Mazzocchi
c976091624 new hooks to the Freebase Refinery
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1368 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 02:19:50 +00:00
David Huynh
823fe989a4 Fixed Issue 110: Import of single column text file with Postal Codes shows only 1 row with lots of � chars (?).
(by enforcing a confidence threshold on the encoding guessing)

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1367 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 00:26:53 +00:00
Stefano Mazzocchi
14d046bb7a silence velocity's logs
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1366 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-28 00:01:53 +00:00
Stefano Mazzocchi
ad1db143c9 more info on how to run the tests in eclipse
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1364 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 23:45:25 +00:00
Stefano Mazzocchi
c44dd68f92 remove duplication
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1363 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 23:39:10 +00:00
Iain Sproat
c3c23a87b0 The renaming of TreeImporter to TreeImportUtilities didn't seem to get committed last time. Trying again.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1362 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:57:26 +00:00
Iain Sproat
d285999da8 New JsonImporter, JsonParser and JsonImporterTests (copy of XmlImporterTests with syntax of the example data altered for Json).
Renaming of TreeImporter to TreeImportUtilities (as per the current convention with the XmlImporter and XmlImportUtilities).

NB the new JsonParser class does not work, and 5 of the new unit tests for JsonImporter currently fail.  To be fixed in due course.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1361 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:53:17 +00:00
Stefano Mazzocchi
565e3b6e90 adding default code style info to eclipse projects
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1356 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 22:11:38 +00:00
Stefano Mazzocchi
86f810a324 hardening the timeline facet
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1353 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 21:59:17 +00:00
Iain Sproat
e5ddfa6fdc All methods in XmlImportUtilities now use the TreeParser interface.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1323 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 17:59:53 +00:00
Iain Sproat
d71c563831 XmlImportUtilities.detectPathFromTag and XmlImportUtilities.detectRecordElement methods now use a generic TreeParser interface. A lightweight wrapper XmlParser wraps XMLStreamReader to provide parsing for xml data.
This is another small step towards a generic importer for tree structured data.  My plan is to refactor more of XmlImportUtilities' methods to use the TreeParser interface so that XmlStreamReader is no longer called directly from XmlImportUtilities.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1322 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 17:40:51 +00:00
Iain Sproat
1bda46d40f Methods which are generic to any tree structured data and don't rely on an XmlParser have been moved to a new TreeImporter class. This is a small step towards supporting importers for other tree structured data.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1321 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 16:09:44 +00:00
Iain Sproat
2df0d1ac3a Eclipse .launch file for tests now correctly points to project 'grefine' rather than incorrect 'refine'.
XmlImport tests now pass - the tests themselves were not testing the current xml being input to them.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1320 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 14:47:06 +00:00
Stefano Mazzocchi
6273332cef the sandbox->freebase loading conduit is now named "refinery"
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1313 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-25 01:57:56 +00:00
David Huynh
8b115f67ff Stray renaming
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1301 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:42:08 +00:00
David Huynh
a112ffa9ab Caught a stray rename miss. Added more generic support for renaming old Java classes so that extensions could remain backward-compatible, too
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1297 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:59:57 +00:00
David Huynh
d15fe661fa Renamed in client-side code. Refine should start and existing projects should still work.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1293 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 22:55:28 +00:00
David Huynh
1367ce301e More renaming, except for: client-side code, build scripts, anything to do with data loading and QA, workspace path. Refine can still run, and undo/redo on existing projects is working.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1290 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 18:36:33 +00:00
David Huynh
e6bc603a11 Renamed Java classes whose names contain 'Gridworks'. Refine is still able to start. But don't check out the code just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1289 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:46:39 +00:00
David Huynh
edb23eb263 Changed Java packages com.google.gridworks.* to com.google.refine.* and modified other code just enough to start grefine up without error. Much remains to be done. Do not check out the code just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1288 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:04:10 +00:00
David Huynh
a40a09b391 Renamed sub-projects as they are known to Eclipse
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1287 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 16:40:46 +00:00
David Huynh
362a277c58 Added main menu command to open system file explorer at the workspace directory.
Made project manager more careful at disposing projects, in case any of them is null.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1272 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 06:52:10 +00:00
David Huynh
2609c4049d Fixed issue 114: "Refactor project manager api to allow importers to create project metadata" by incorporating tfmorris' patch.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1271 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 01:00:23 +00:00
David Huynh
8d1f2d44b9 Patched the json lib to allow up to 100 levels of nesting.
Fixed ImportProjectCommand to redirect from the error page back to /index rather than /index.html.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1270 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 00:21:54 +00:00
Stefano Mazzocchi
eee4514643 fixing Issue-125
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1269 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 23:05:53 +00:00
Stefano Mazzocchi
df0a30e22d this is really a debug log
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1262 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 21:52:02 +00:00
David Huynh
9acd3dbe05 Fixed issue 127 - Add column from Freebase raises exception. Made sure DataExtensionChange saves properly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1261 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 04:53:37 +00:00
Stefano Mazzocchi
4af3a4cbf5 allow gridworks to load extensions also from the same place where data is stored (in the 'extension' subfolder')
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1258 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-02 01:17:54 +00:00
Stefano Mazzocchi
e973fd3e89 d'oh, wrong object counter (thanks again to knut.forkalsrud for spotting my mistakes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1250 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:25:16 +00:00
Stefano Mazzocchi
3bb0d3918c adding the latest butterfly with the ability to avoid loading specific extensions (selected with strings or regexps on their names) via a property
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1247 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:08:47 +00:00
Stefano Mazzocchi
df19fc2065 Fixed ISSUE-29
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1245 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:47:01 +00:00
Stefano Mazzocchi
e5c6dda178 Fixed Issue-116
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1243 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:33:05 +00:00
Stefano Mazzocchi
7df259008b more whitespace
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1242 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:59 +00:00
Stefano Mazzocchi
cf66d00854 only whitespace (no functional changes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1240 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:48 +00:00
Stefano Mazzocchi
860d6c4ee2 a little more solid (it's possible to have both Dates and Calendars in there)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1239 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:12:11 +00:00
Stefano Mazzocchi
3648883e0c ISSUE-99 thanks to knut.forkalsrud for providing the patch!
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1238 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 18:56:35 +00:00
Stefano Mazzocchi
5d788c9260 added timeline facet (like the numeric binning facet but working on dates instead of numbers and with date-specific binning logic)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1234 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 17:57:54 +00:00
Stefano Mazzocchi
37099b4c9f new butterfly version: mount extensions in their own subdirectory in the URL space
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1233 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 01:14:47 +00:00
David Huynh
e809642464 Fixed link to index
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1230 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-27 14:59:01 +00:00
David Huynh
bd7453adba Made sure to strip off charset from content-type when importing from URLs before looking up for the right importer.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1229 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-25 17:35:16 +00:00
David Huynh
367796488e Fixed xml importer: subgroups should now line up properly by rows.
Added command to reorder columns using drag and drop.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1227 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-25 06:17:08 +00:00
David Huynh
2ca12f5771 Switched to version 2 of standard-reconcile service, which returns MIDs instead of IDs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1222 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-24 18:46:26 +00:00
David Huynh
276fae8938 Save templating exporter's template.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1221 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-24 06:36:49 +00:00
David Huynh
baa4e0db8c Added command to browse to the data load page on the Gridworks QA dashboard.
Save the data load job name and fill it in the next time the Load into Freebase dialog is opened.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1220 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-24 06:06:07 +00:00
David Huynh
e4af19f8a6 Namespaced operations' names by their modules' names.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1215 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-24 04:02:36 +00:00
David Huynh
d61522eb64 Started to make index page extensible.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1214 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-24 00:52:19 +00:00
David Huynh
1f69fba43c Added command Add Column by Fetching URLs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1203 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-22 23:55:07 +00:00
David Huynh
9041ebf7b9 Bumped version to 1.5.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1195 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-22 06:42:21 +00:00