Commit Graph

3423 Commits

Author SHA1 Message Date
Stefano Mazzocchi
d312a75b2f don't need this
git-svn-id: http://google-refine.googlecode.com/svn/trunk@378 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-03 01:01:43 +00:00
Stefano Mazzocchi
c24ec94835 had to shuffle around a bunch of classes to separate the main server classloader from the context classloader and allow reloading to happen for real
git-svn-id: http://google-refine.googlecode.com/svn/trunk@377 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-03 01:01:11 +00:00
Stefano Mazzocchi
72203cd3d5 - moved all code that contained MIT IP outside (http://code.google.com/p/simile-vicino/)
- moved bzip2 and tar code from apache ant into their own jar files
- now gridworks source contains only com.metaweb.* code everything else is a jar dependency
- started to work on archive importer


git-svn-id: http://google-refine.googlecode.com/svn/trunk@376 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 23:40:12 +00:00
Stefano Mazzocchi
4eda7ae2c0 avoid an array out of bounds exception in case there are no columns in the dataset
(I know, it should not happen but when it does let's not barf)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@375 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 20:21:41 +00:00
Stefano Mazzocchi
62f5f21ca3 atom is handled as well by the XML importer
git-svn-id: http://google-refine.googlecode.com/svn/trunk@374 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 06:44:05 +00:00
Stefano Mazzocchi
83faee3aa9 add a frame-less menu item in macosx to be able to open another gridworks browser window/tab in case we closed it by mistake
(no idea how to do this on windows, though, since there is no frame-less menu concept there)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@373 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 06:39:41 +00:00
Stefano Mazzocchi
521acda025 - pass the svn revision as format version (for more detailed verification)
- add an 'autoreload' setting that makes Gridworks autoreload its self if a class gets changed
(this is useful to make development cycles faster when working on the java code with autocompiling IDE like Eclipse or IDEA)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@372 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-02 00:52:38 +00:00
Stefano Mazzocchi
d1e72e7797 make the undo dialog closable
git-svn-id: http://google-refine.googlecode.com/svn/trunk@371 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 19:11:37 +00:00
Stefano Mazzocchi
988378c761 Hmm, String.split() bites us again: use the commons-lang one instead to avoid having to escape regexp values (this was preventing a user from splitting by "." in GEL)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@370 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 17:49:31 +00:00
Stefano Mazzocchi
0e07ec7acc crude, I know, but for now make Gridworks digest RDF/XML as it was XML (works surprisingly well, btw)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@369 7d457c2a-affb-35e4-300a-418c747d4874
2010-04-01 16:56:38 +00:00
Stefano Mazzocchi
dced641599 - added the ability to specify the character separator for CSV or TSV files that don't use commas or tabs (this was needed to parse a dataset that we got from the BBC to try things out)
- used commons-lang split function instead of the java String.split one, this is necessary to avoid having to escape separators that might be confused for regexps


git-svn-id: http://google-refine.googlecode.com/svn/trunk@368 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 22:34:21 +00:00
Stefano Mazzocchi
77b452e87f adding version information to the about page
NOTE: this shows up only in the packaged distribution


git-svn-id: http://google-refine.googlecode.com/svn/trunk@367 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 19:20:32 +00:00
Stefano Mazzocchi
9db9351a0b - condensing exe and cli distributions into one for windows (shipping both the .exe and the .bat in the same package)
- renamed the distibutions based on their target OS
- more polishing here and there


git-svn-id: http://google-refine.googlecode.com/svn/trunk@365 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 02:11:53 +00:00
Stefano Mazzocchi
3c9af6501e more consistent naming and various polishing
git-svn-id: http://google-refine.googlecode.com/svn/trunk@364 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 01:07:52 +00:00
Stefano Mazzocchi
5884d257db default to 'relevance' instead of recon (which is faster). Change to recon if the user suggests schema hooks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@363 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 01:07:17 +00:00
Stefano Mazzocchi
571f2c9ab3 - better README
- made the build system obtain and use svn revision info directly in version.js
- fixed lunch4j initial memory usage
- added .ini support for .exe starting in windows
- more robust up-to-date logic that uses SVN revisions instead of dates
- connected to new freebase.com/labs/gridworks web site


git-svn-id: http://google-refine.googlecode.com/svn/trunk@362 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-31 00:17:40 +00:00
Stefano Mazzocchi
8508fbbd01 - adding the BSD license
- changed a buch of issues with cli builds and windows operations
- added the ability to persist init parameters in an gridworks.ini file (both used in windows and unix)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@359 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-30 21:24:04 +00:00
Stefano Mazzocchi
f0fec31d98 moving the gridworks site over to freebase.com/labs for now (which is in acre)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@358 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-30 21:23:00 +00:00
Stefano Mazzocchi
7c132cfa53 clean eclipse warnings
git-svn-id: http://google-refine.googlecode.com/svn/trunk@357 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-29 20:51:18 +00:00
David Huynh
1d0e6abaf8 Got some work done on the plane:
- better detection of record XML elements in XML importer
- XML importer creates column groups and data table view renders them


git-svn-id: http://google-refine.googlecode.com/svn/trunk@356 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-27 05:23:09 +00:00
David Huynh
2a9fbd7d81 Made sure columns are named hierarchically in XML importer.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@355 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 23:15:10 +00:00
David Huynh
df7389876f First shot at XML import.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@354 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 23:08:08 +00:00
David Huynh
4e76155652 Use application/x-unknown when exporting TSV so that the browser just saves the result. This is good for large exports, which overload the browser if the browser tries to display them.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@353 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 17:51:27 +00:00
David Huynh
47cad64a3f Properly unescape \t, \r, \n, \\.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@352 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 17:50:44 +00:00
David Huynh
30e3ca4965 Added splitByLengths function.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@351 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 04:30:31 +00:00
David Huynh
4df1c4107a Fixed a bug introduced recently: recon candidates were not serializing their topic types for the data view, so in the data view we can't send back a candidate's types when the user wants to match the candidate to some cells. I need to figure out a better way to optimize this.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@350 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-24 03:58:52 +00:00
David Huynh
00cce1b99a Styling tweaks.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@349 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 21:24:44 +00:00
David Huynh
c6cd48a6d2 Polished the path in the header pane.
Made dropdown menu graphics a little more subtle.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@348 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 21:12:13 +00:00
David Huynh
32e395d0e6 Updated version's date for another rolling release.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@347 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 19:43:06 +00:00
David Huynh
cbfa77dcaa Polished history widget. Now there's a link to roll the widget up.
Made sure busy dialog also has rounded corners on Chrome.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@346 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 19:30:50 +00:00
David Huynh
5c97177efd Added "reset" and "remove" links to facet panel.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@345 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 18:53:29 +00:00
David Huynh
85d1671d6e Fixed minor bug: recon wasn't saving out its candidates if its judgment is Matched. So when a project is saved and reloaded, it loses all of the recon candidates except for the matches.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@344 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 18:02:08 +00:00
David Huynh
ac57dea9c7 Do our own positioning of the process widget rather than using margin:auto, so that the links on the top header panel don't get obscured.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@343 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 04:48:05 +00:00
David Huynh
455802bffb Alert user of new version to download, if any.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@342 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 04:36:01 +00:00
David Huynh
2846d66261 Detect max cell index on load, just in case the max cell index we've stored previously was out of whack.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@341 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 03:19:17 +00:00
David Huynh
f8d30e9e8e Don't send back recon candidate types for rendering cells.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@340 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 01:17:45 +00:00
David Huynh
c07ba83a36 Don't send back recon candidate types for rendering cells.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@339 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 01:17:32 +00:00
David Huynh
3dc4db020f Support quick undo of the last operation (Ctrl-Z).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@338 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-23 00:26:28 +00:00
David Huynh
6d8776953d Added case sensitive and regex checkboxes to text search facets.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@337 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 23:01:53 +00:00
David Huynh
f5d270e35a Fixed "off by one bucket" bug in range facet's binning algorithm.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@336 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 22:04:29 +00:00
David Huynh
19ba207d27 Re-ordered the "other" choices in numeric range facets to make better use of space.
Changed main layout of whole application so that the horizontal scrollbar of the data table is visible without scrolling.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@335 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 21:48:36 +00:00
David Huynh
1d20b33cf1 Documented the history package.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@334 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 18:19:31 +00:00
David Huynh
ec0110d65b Documented gel.* packages.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@333 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 05:44:35 +00:00
David Huynh
60dd7eab82 Documented expr.* packages.
Converted some tabs into spaces.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@332 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-22 00:54:56 +00:00
David Huynh
60f60507f7 Fixed minor bug introduced recently into the Export Project menu command.
Documented the commands.* packages.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@331 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-21 20:26:35 +00:00
David Huynh
7648126a5e Made facets deal with java.util.Collection rather than just Object[].
Documented the browsing.* packages.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@330 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-21 07:14:39 +00:00
David Huynh
d90e75dff1 Started a round of documentation.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@329 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-20 23:56:28 +00:00
David Huynh
a43b2a72c1 Made various GEL functions and the forEach control work with java.util.List and java.util.Collection in addition to just Object[].
Added field columnNames to row object.
Added 1-bounded numeric log facet.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@328 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 23:04:17 +00:00
Vishal Talwar
6fba7d1e7f sped up jython evaluation by calling function directly instead of invoking parser on string representation of function call
git-svn-id: http://google-refine.googlecode.com/svn/trunk@327 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 21:45:01 +00:00
David Huynh
ff0049307e Increased file upload size limit to 1GB.
Fixed charset detector to be more robust in trying more than one charset.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@326 7d457c2a-affb-35e4-300a-418c747d4874
2010-03-19 19:53:11 +00:00