Commit Graph

3056 Commits

Author SHA1 Message Date
David Huynh
bc412b99ea Fixed bug in triple loader transposer: properties didn't get asserted before.
Made triple loader transposer index its output variables.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@106 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 07:05:09 +00:00
David Huynh
5264c829ae A bit more careful error handling during recon.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@105 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 01:17:35 +00:00
David Huynh
28a86dfe0f Automatically guess types to reconcile a column, using Stefano's trick in his "cupid" acre app.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@104 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-19 00:28:34 +00:00
David Huynh
4b2e48614b Actual work in operations must be delayed until their changes are applied.
Column addition change must track the new cell index that it allocates when it is first applied.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@103 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-18 23:27:40 +00:00
David Huynh
8831703a2c Implemented "apply operations" feature.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@102 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-18 05:00:56 +00:00
David Huynh
604dd53ebd Engine configs were not deserialized properly when abstract operatons are retrieved.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@101 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:40:41 +00:00
David Huynh
b1afd12bd5 Forgot a console.log().
git-svn-id: http://google-refine.googlecode.com/svn/trunk@100 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:31:28 +00:00
David Huynh
8c41af9c12 Allow operations to be extracted in abstract forms.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@99 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 01:30:09 +00:00
David Huynh
32157ce76b Changed operations to record column names instead of cell indices.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@98 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-17 00:26:38 +00:00
David Huynh
e6a98f23bd Implemented triple loader preview.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@97 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 23:32:12 +00:00
David Huynh
aa530395d2 Use tabs in the schema alignment dialog to get more space.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@96 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 20:45:04 +00:00
David Huynh
5de0c36f86 Protograph preview now works.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@95 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-16 20:15:19 +00:00
David Huynh
0d11e469a6 Added a preview pane in the schema alignment dialog, but it's not hooked up yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@94 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-13 01:08:12 +00:00
David Huynh
b17f71c8df More work on schema alignment dialogs: schema data from Freebase is used to automatically fill in a lot of fields in the protograph.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@93 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-13 00:11:42 +00:00
David Huynh
8189ba74fd Schema alignment dialog now saves protograph and re-renders it properly.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@92 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 22:11:49 +00:00
David Huynh
425140261f We're starting to be able to save protographs.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@91 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 20:29:56 +00:00
David Huynh
634d666949 More work on the schema alignment node dialog.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@90 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-12 02:23:26 +00:00
David Huynh
f5942773ec Still more work on the protograph, toward being able to build and save a protograph, but it's not working yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@89 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-11 06:44:48 +00:00
David Huynh
d227db0cc6 Eliminate hash maps from recon objects--they are expensive.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@88 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 19:25:21 +00:00
David Huynh
5cd147ea3c Compute record indices and render them instead of row indices.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@87 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 18:54:53 +00:00
Stefano Mazzocchi
5aaa9394da - add notes about calling this build script directly
- forgot to copy the webapp parts when building the windows distribution


git-svn-id: http://google-refine.googlecode.com/svn/trunk@86 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 18:47:32 +00:00
Stefano Mazzocchi
91d0133a76 - build a windows distribution
- update to the latest ant to use in case one is not present locally


git-svn-id: http://google-refine.googlecode.com/svn/trunk@85 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 07:30:16 +00:00
David Huynh
36e609d649 Minor fix to build.xml.
More work on schema alignment dialog: add another dialog for configuring a protograph node.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@84 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 06:39:55 +00:00
David Huynh
242e23c085 The schema alignment dialog is starting to work. The protograph gets rendered and is interactive. No saving yet.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@83 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-10 01:15:25 +00:00
David Huynh
39899b9eeb Added topic block previews for recon candidates.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@82 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 20:36:47 +00:00
Stefano Mazzocchi
7510594d0d build MacOSX desktop app and disk installer
git-svn-id: http://google-refine.googlecode.com/svn/trunk@81 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 20:21:10 +00:00
David Huynh
97e2e0eddc Implemented "judge one cell" command for making recon judgment per cell.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@80 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 20:04:43 +00:00
David Huynh
66457f0515 Factored out column header UI and cell UI to prepare for single cell commands.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@79 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 19:23:11 +00:00
David Huynh
c73ec864db A bit more prep work on rendering protograph nodes.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@78 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 06:46:40 +00:00
David Huynh
f8a1daba62 Handle formula cells in Excel files.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@77 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 01:13:11 +00:00
David Huynh
8802323839 Some more code for the schema alignment dialog box, but nothing is working yet, and I still need to think about how this should work.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@76 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 00:45:54 +00:00
David Huynh
402858a784 Process widget was causing the UI to get rendered twice at start-up.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@75 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-09 00:26:57 +00:00
David Huynh
b7cf18b86a Save a change right after it gets applied rather than when it gets created. This is because when a change gets applied, it might grab onto the old data in order to able to revert later, and we need to save that old data together with the change.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@74 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 23:57:37 +00:00
David Huynh
cd376c7532 Added support for Excel 2007 XML file format.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@73 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 23:44:33 +00:00
David Huynh
736c6ec1de Added main menu bar.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@72 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 21:41:49 +00:00
Stefano Mazzocchi
5697dbf491 bye bye maven
git-svn-id: http://google-refine.googlecode.com/svn/trunk@71 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 20:50:30 +00:00
Stefano Mazzocchi
2b985bf45a moving json support in its own jar (code was taken today directly from json.org and compiled and packaged by me)
git-svn-id: http://google-refine.googlecode.com/svn/trunk@70 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 20:46:27 +00:00
Stefano Mazzocchi
1f5b27653e POI deprecated the use of short, good thing
git-svn-id: http://google-refine.googlecode.com/svn/trunk@69 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 19:54:09 +00:00
Stefano Mazzocchi
6d4425c696 - finally java6 has support for wildcards... seriously, how fricking long did it take for this to happen?!
- so require java6 since it's also a lot faster and a lot more reliable for us for profiling


git-svn-id: http://google-refine.googlecode.com/svn/trunk@68 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 19:48:28 +00:00
David Huynh
bf938d4a9c Minor fixes to get gridworks.bat working on Windows.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@67 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-08 19:28:13 +00:00
Stefano Mazzocchi
f198cba994 I guess wildcards are not supported in java6 on macs
git-svn-id: http://google-refine.googlecode.com/svn/trunk@66 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 23:22:34 +00:00
Stefano Mazzocchi
1343162a75 major rewrite of the foundation:
- de-maveniziation (uses the same code that Acre uses to drive jetty directly)
 - removed all dependencies on external javascript code (jquery and suggest) by making a local copy (this makes gridworks totally self-serving, meaning that you can use it even if you don't have any internet connectivity)
 - fixed a NPE when the servlet is shutdown before any project is loaded
 - found a way to spawn a browser directly from the java code (untested in windows)
 - added two ant tasks to generate windows and macosx stand-alone binaries (unused just yet)

To run, just type "./gridworks run" at the command line


git-svn-id: http://google-refine.googlecode.com/svn/trunk@65 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 23:15:50 +00:00
David Huynh
8f186a5f10 Added a help panel to the expression preview dialog. It gets populated by function and control names for now; more info will come later.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@64 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 09:02:22 +00:00
David Huynh
d3f97fea93 While importing data, use null for cells with empty text.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@63 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 07:16:39 +00:00
Stefano Mazzocchi
a61f35079a make eclipse happier by removing @Override annotations when really it's an interface method implementation
(no functional changes)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@62 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 06:47:52 +00:00
David Huynh
a025b272bd String.isEmpty() is no longer there (?!).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@61 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 06:16:46 +00:00
Stefano Mazzocchi
c194beb11a hmm, shell files are picky
git-svn-id: http://google-refine.googlecode.com/svn/trunk@60 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 05:32:07 +00:00
Stefano Mazzocchi
2077d3f094 adding unix and windows startup scripts
use maven to build the eclipse scripts instead of committing them in svn which makes them less portable
(do './gridworks eclipse' at the beginning to regenerate your eclipse project files, then reload in eclipse)


git-svn-id: http://google-refine.googlecode.com/svn/trunk@59 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 05:25:44 +00:00
David Huynh
2ceaa53390 Split css files.
Allow expressions to be evaluated on null cells too.
Various minor UI polishing touches and minor bug fixes.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@58 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 03:36:40 +00:00
David Huynh
51de415809 Added recon config to column model so we can use that to do automatic schema alignment.
Fixed bug in recon operation that also tried to reconcile blank cells.

git-svn-id: http://google-refine.googlecode.com/svn/trunk@57 7d457c2a-affb-35e4-300a-418c747d4874
2010-02-07 01:07:45 +00:00