Commit Graph

1021 Commits

Author SHA1 Message Date
Iain Sproat
e5ddfa6fdc All methods in XmlImportUtilities now use the TreeParser interface.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1323 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 17:59:53 +00:00
Iain Sproat
d71c563831 XmlImportUtilities.detectPathFromTag and XmlImportUtilities.detectRecordElement methods now use a generic TreeParser interface. A lightweight wrapper XmlParser wraps XMLStreamReader to provide parsing for xml data.
This is another small step towards a generic importer for tree structured data.  My plan is to refactor more of XmlImportUtilities' methods to use the TreeParser interface so that XmlStreamReader is no longer called directly from XmlImportUtilities.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1322 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 17:40:51 +00:00
Iain Sproat
1bda46d40f Methods which are generic to any tree structured data and don't rely on an XmlParser have been moved to a new TreeImporter class. This is a small step towards supporting importers for other tree structured data.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1321 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 16:09:44 +00:00
Iain Sproat
2df0d1ac3a Eclipse .launch file for tests now correctly points to project 'grefine' rather than incorrect 'refine'.
XmlImport tests now pass - the tests themselves were not testing the current xml being input to them.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1320 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-27 14:47:06 +00:00
Stefano Mazzocchi
6273332cef the sandbox->freebase loading conduit is now named "refinery"
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1313 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-25 01:57:56 +00:00
David Huynh
289d4e4c57 Ignore classes/ sub-dirs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1303 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:47:29 +00:00
David Huynh
64727ebd03 Somehow this fell out of a previous checkin.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1302 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:45:18 +00:00
David Huynh
8b115f67ff Stray renaming
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1301 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-23 00:42:08 +00:00
David Huynh
a112ffa9ab Caught a stray rename miss. Added more generic support for renaming old Java classes so that extensions could remain backward-compatible, too
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1297 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:59:57 +00:00
David Huynh
289f46bbf5 Removed yet another old empty Java package dir.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1296 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:36:34 +00:00
David Huynh
6ee94132f9 Removed another old empty Java package dir.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1295 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:34:03 +00:00
David Huynh
b45506b787 If refine data dir doesn't exist, try to find and move gridworks data dir over
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1294 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 23:30:14 +00:00
David Huynh
d15fe661fa Renamed in client-side code. Refine should start and existing projects should still work.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1293 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 22:55:28 +00:00
David Huynh
22cbcddd78 Renamed in build scripts. ./refine build, ./refine, and ./refine dist all work.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1292 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 21:47:35 +00:00
David Huynh
fe855c3aae Removed old empty Java packages
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1291 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 18:49:08 +00:00
David Huynh
1367ce301e More renaming, except for: client-side code, build scripts, anything to do with data loading and QA, workspace path. Refine can still run, and undo/redo on existing projects is working.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1290 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 18:36:33 +00:00
David Huynh
e6bc603a11 Renamed Java classes whose names contain 'Gridworks'. Refine is still able to start. But don't check out the code just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1289 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:46:39 +00:00
David Huynh
edb23eb263 Changed Java packages com.google.gridworks.* to com.google.refine.* and modified other code just enough to start grefine up without error. Much remains to be done. Do not check out the code just yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1288 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 17:04:10 +00:00
David Huynh
a40a09b391 Renamed sub-projects as they are known to Eclipse
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1287 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 16:40:46 +00:00
David Huynh
65dbb5a2dc Removed RDF Exporter extension as its authors have agreed to host it themselves. For more information on that extension, see http://lab.linkeddata.deri.ie/2010/dcat/
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1286 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-22 15:49:45 +00:00
Stefano Mazzocchi
871bd356f6 swapping extensions
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1285 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-21 17:43:50 +00:00
Stefano Mazzocchi
06e83a66ff Issue-134: make sure the {build.dir} directory actually exist (it normally gets created by the invoking ./gridworks shell file, but only when calling a build, not a dist, so act more defensively here)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1275 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-19 17:08:39 +00:00
Stefano Mazzocchi
0a0add9e8e Issue-135: d'oh, stupid mistake
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1274 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-19 16:44:04 +00:00
David Huynh
362a277c58 Added main menu command to open system file explorer at the workspace directory.
Made project manager more careful at disposing projects, in case any of them is null.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1272 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 06:52:10 +00:00
David Huynh
2609c4049d Fixed issue 114: "Refactor project manager api to allow importers to create project metadata" by incorporating tfmorris' patch.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1271 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 01:00:23 +00:00
David Huynh
8d1f2d44b9 Patched the json lib to allow up to 100 levels of nesting.
Fixed ImportProjectCommand to redirect from the error page back to /index rather than /index.html.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1270 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-17 00:21:54 +00:00
Stefano Mazzocchi
eee4514643 fixing Issue-125
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1269 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 23:05:53 +00:00
Stefano Mazzocchi
9c5eda343a last of the bashisms (hopefully)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1268 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 22:00:54 +00:00
Stefano Mazzocchi
3667806e3a does not belong in version control
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1267 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 21:59:31 +00:00
Stefano Mazzocchi
81190b1e50 fixing bashisms
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1266 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 21:53:38 +00:00
Stefano Mazzocchi
221504e2da - removed all bash-isms (hopefully)
- hardened FULL_VERSION calculation (used only in the win32 exe, btw)
- made windows_dist run on all operating systems, not just mac

git-svn-id: http://google-refine.googlecode.com/svn/trunk@1265 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-16 18:16:46 +00:00
Stefano Mazzocchi
df0a30e22d this is really a debug log
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1262 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 21:52:02 +00:00
David Huynh
9acd3dbe05 Fixed issue 127 - Add column from Freebase raises exception. Made sure DataExtensionChange saves properly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1261 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-10 04:53:37 +00:00
Stefano Mazzocchi
e6d293bcc8 make it compile again
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1260 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-05 06:07:32 +00:00
Stefano Mazzocchi
4af3a4cbf5 allow gridworks to load extensions also from the same place where data is stored (in the 'extension' subfolder')
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1258 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-02 01:17:54 +00:00
Stefano Mazzocchi
089105a997 turns out people are having problems with this in Ubuntu... so let's avoid it for now
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1257 7d457c2a-affb-35e4-300a-418c747d4874
2010-09-01 00:23:56 +00:00
Stefano Mazzocchi
e973fd3e89 d'oh, wrong object counter (thanks again to knut.forkalsrud for spotting my mistakes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1250 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:25:16 +00:00
Stefano Mazzocchi
3bb0d3918c adding the latest butterfly with the ability to avoid loading specific extensions (selected with strings or regexps on their names) via a property
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1247 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 23:08:47 +00:00
Stefano Mazzocchi
86ee119c44 more changes
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1246 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:47:06 +00:00
Stefano Mazzocchi
df19fc2065 Fixed ISSUE-29
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1245 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:47:01 +00:00
Stefano Mazzocchi
21ac857a50 more changes
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1244 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:33:10 +00:00
Stefano Mazzocchi
e5c6dda178 Fixed Issue-116
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1243 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:33:05 +00:00
Stefano Mazzocchi
7df259008b more whitespace
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1242 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:59 +00:00
Stefano Mazzocchi
900b173d99 keep track of changes
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1241 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:54 +00:00
Stefano Mazzocchi
cf66d00854 only whitespace (no functional changes)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1240 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:32:48 +00:00
Stefano Mazzocchi
860d6c4ee2 a little more solid (it's possible to have both Dates and Calendars in there)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1239 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 19:12:11 +00:00
Stefano Mazzocchi
3648883e0c ISSUE-99 thanks to knut.forkalsrud for providing the patch!
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1238 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 18:56:35 +00:00
Stefano Mazzocchi
8bff031c1c increasing permgen and enabling garbage collection of classes in debug mode
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1235 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 17:58:15 +00:00
Stefano Mazzocchi
5d788c9260 added timeline facet (like the numeric binning facet but working on dates instead of numbers and with date-specific binning logic)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1234 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 17:57:54 +00:00
Stefano Mazzocchi
37099b4c9f new butterfly version: mount extensions in their own subdirectory in the URL space
git-svn-id: http://google-refine.googlecode.com/svn/trunk@1233 7d457c2a-affb-35e4-300a-418c747d4874
2010-08-31 01:14:47 +00:00