David Huynh
5aec75696d
Fixed Issue 477 in google-refine: Implement or remove the line separator option.
...
Also, fixed displaying bug in the fixed-width parser UI: previously, tab characters forced columns to be wider.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2364 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-06 20:13:05 +00:00
David Huynh
a35b9f53f7
Made operation "Transpose columns into rows" support the option of transposing into 2 new columns rather than just one.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2362 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-06 02:50:33 +00:00
Tom Morris
85a37d23f9
Issue 474 - implement record limit for XML and JSON importers
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2359 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-05 16:38:19 +00:00
David Huynh
b36b229ba4
Fixed Issue 465: Data text file with extension .dta within a .ZIP is not automatically extracted
...
.dta isn't recognized so there's no best format detected. But now we default to text/line-based and always select all files if no file gets selected by default.
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2358 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-04 22:33:38 +00:00
David Huynh
41a90ad71f
Fixed Issue 459: Undefined error with some CSV files (incorrectly detected as EXCEL)
...
by favoring file name-based format over mime type-based format (because the user's computer might have .csv registered as an Excel format).
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2357 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-04 21:52:12 +00:00
David Huynh
2f6b635f66
Added initial implementation of Key/value Columnize operation and command.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2356 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-04 21:00:32 +00:00
Tom Morris
a7c81880a8
Issue 475 - Support escaped custom separators
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2355 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-04 19:04:16 +00:00
Tom Morris
cacbedd352
Fix index out of bounds exception when separator is the empty string
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2354 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-04 17:31:51 +00:00
Stefano Mazzocchi
09889e75c5
allow people with freeq keys to load data bypassing oauth signature (whichi is poorly implemented in SignPost and consumes ridiculous amounts of memory to perform the OAuth signing).
...
NOTE: this is only a temporary measure while we figure out a better way to perform authorized loads
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2353 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-01 21:47:34 +00:00
Stefano Mazzocchi
856ef6a65a
commented out unused variables
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2352 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-01 21:47:24 +00:00
Stefano Mazzocchi
8184e16bb9
updating http client and http core to the latest released versions
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2351 7d457c2a-affb-35e4-300a-418c747d4874
2011-11-01 21:46:56 +00:00
Tom Morris
83db311b18
Issue 473 - Fix cancel button for project creation import phase
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2350 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-29 22:05:24 +00:00
Tom Morris
71492c706c
Just some TODOs
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2349 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-28 17:51:20 +00:00
Tom Morris
ad8705e299
Javadoc only
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2348 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-28 17:29:35 +00:00
Tom Morris
a870e782f5
Make sure out counts our current before attempting to use them for sorting
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2347 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-28 17:28:27 +00:00
Tom Morris
5dad4d6a0b
Handle legacy projects which have an empty slot 0 for the column model (old off-by-one bug)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2346 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-23 19:29:44 +00:00
Tom Morris
fe7bf1d1f0
Refactor to conform to method naming conventions
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2344 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-21 16:57:52 +00:00
Tom Morris
ab950689dd
Add debugging info - mostly toString() methods for types missing them
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2343 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-21 16:46:55 +00:00
Tom Morris
b2781bda3f
Javadoc only
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2342 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-21 16:30:37 +00:00
Tom Morris
9a9f4c1354
Issue 467 - provide JVM heap usage as part of the progress monitor during project creation.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2341 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-21 16:28:40 +00:00
David Huynh
ff7bbc8ec0
Export libraries.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2338 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-20 09:20:53 +00:00
David Huynh
f4b2ee3715
"Transpose columns into rows" operation now supports specifying the ending column to be the last column regardless of its name.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2337 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-19 13:42:50 +00:00
David Huynh
223074bb25
Xml importer should stop trying to skip over initial non-xml content after some number of characters.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2336 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-18 15:25:31 +00:00
Tom Morris
9710521ef8
Correct column counting so maxCellIndex represents current count rather than next column
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2335 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-14 21:00:50 +00:00
Tom Morris
5d6ab76b7c
Issue 313 - fix cell format so dates export as dates rather than numbers.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2334 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-14 20:23:59 +00:00
Tom Morris
2d5125af1e
Issue 462 - don't trim whitespace from string-valued cell contents on import
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2330 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-12 23:45:52 +00:00
Tom Morris
4966dcc61f
Basic trivial test for Excel importer. Please add to this...
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2329 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-12 23:42:29 +00:00
Tom Morris
5c95c9c1f9
New exporter - Open Document Format (ODF) spreadsheets (.ods)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2326 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 21:02:23 +00:00
Tom Morris
3bd84088da
Rename OO/ODS importer with more generic name
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2325 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 21:01:45 +00:00
Tom Morris
ee0fb9033e
Javadoc
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2324 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:57:40 +00:00
Tom Morris
ca17e1ef0a
New importer for Open Document Format (ODF) spreadsheet files (.ods)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2323 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:27:40 +00:00
Tom Morris
5bde74275b
Log JVM max heap size instead of just Refine property
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2322 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:21:28 +00:00
Tom Morris
2726f61a61
Add toString methods to help with debugging
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2321 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:19:53 +00:00
Tom Morris
5c856179cb
Add TODO for suspicious code
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2320 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:14:57 +00:00
Tom Morris
d4ec9f3279
Make sure writer is closed so we don't leak file handles
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2319 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:14:13 +00:00
Tom Morris
16421303cb
Add Javadoc
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2318 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 20:12:24 +00:00
Tom Morris
da0e68dd76
Fix typo in variable name
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2317 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-11 16:26:21 +00:00
David Huynh
55c3fdebab
Bumped up version to 2.5.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2314 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-10 21:58:42 +00:00
David Huynh
1a14d82393
For XML files, ignore not just leading whitespace but anything except <.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2313 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-10 20:51:00 +00:00
David Huynh
99830b2ea9
Half fix for Issue 459: Undefined error with some CSV files (incorrectly detected as EXCEL) - just made sure the error messages are shown properly rather than "Undefined".
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2312 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-10 17:46:19 +00:00
David Huynh
8a7b2fdd4e
Added chrome frame detection.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2305 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-09 01:04:51 +00:00
Tom Morris
ef39d18cae
Switch line endings to make it easier for Windows users
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2304 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-08 19:56:16 +00:00
Stefano Mazzocchi
b413833ca1
fixing a bunch of jslint issues
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2303 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 23:44:11 +00:00
Tom Morris
fffd24d64b
Parse parameters from multipart/form-data POSTs rather than just dropping them (needed for Windmill tests, among other things)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2302 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 23:15:55 +00:00
Stefano Mazzocchi
1f67866258
fixing a bunch of inconsistencies and potential bugs as indicated by findbugs, pmd and eclipse
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2301 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 21:23:23 +00:00
Stefano Mazzocchi
df29c0b281
whitespace (no functional changes)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2300 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 19:33:28 +00:00
Stefano Mazzocchi
af5b49eab9
whitespace (no functional changes)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2299 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 19:25:15 +00:00
Stefano Mazzocchi
c0907c79ab
more to ignore
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2298 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 19:20:13 +00:00
Tom Morris
6b97f6602f
Fix last(?) failing test
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2297 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 19:16:19 +00:00
Tom Morris
31073d7712
Refactor importer interfaces to narrow exceptions thrown and handled
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2296 7d457c2a-affb-35e4-300a-418c747d4874
2011-10-07 19:06:53 +00:00