Tom Morris
0dc14af1aa
Fix bug in refactoring of ImportingJob from commit
...
1e5f89e84c
2013-07-26 18:50:03 -04:00
Tom Morris
46a1e198d8
Recompute max cell index when rebuiling maps in ColumnModel - fixes #406
2013-07-26 18:48:20 -04:00
Tom Morris
7edc550618
Give a reasonable error message on Excel 95 import failure - fixes #564
2013-07-26 16:24:56 -04:00
Tom Morris
dc4d04c132
Allow arrays containing null in Filter & ForEach - fixes #741
2013-07-26 15:20:44 -04:00
Tom Morris
1e5f89e84c
Centralize handling of import job config object & synchronize to allow
...
multiple accessors
2013-07-25 15:41:08 -04:00
Tom Morris
dc206e1889
Switch to ConcurrentHashMap for jobs table to allow multiple accessors
2013-07-25 15:36:54 -04:00
Tom Morris
0ff2d7ed9f
Simplify implementation from pull request #728
2013-07-25 13:45:44 -04:00
Tom Morris
6dd4b8ea23
Add tests for boolean functions and tighten up error handling
2013-07-25 13:45:04 -04:00
Tom Morris
2c2c0d3d68
Merge pull request #728 from jmcastagnetto/master
...
Implements Xor operation
2013-07-25 10:00:11 -07:00
Blakko
6e90bc41f6
Merge remote-tracking branch 'origin/master' into internationalization
...
Conflicts:
extensions/freebase/module/scripts/dialogs/schema-alignment/schema-alignment-dialog.html
main/webapp/modules/core/index.vt
main/webapp/modules/core/project.vt
main/webapp/modules/core/scripts/project/browsing-engine.js
main/webapp/modules/core/scripts/project/history-panel.html
2013-07-25 11:07:59 +02:00
Blakko
e6e6c8c002
Added a "Language Settings" menu at index
...
Now the language manually set has priority over the browser lang
Update translations
2013-07-12 11:12:33 +02:00
Tom Morris
92e4427c39
Adding a TODO
2013-07-10 15:13:22 -04:00
Tom Morris
32773122c4
Fix CollationKey creation - fixes #753
2013-07-10 15:12:49 -04:00
Blakko
552b0bf94b
Internationalization of the index part (create/open/update) of refine
2013-07-02 13:40:50 +02:00
Tom Morris
5b6bc888f7
Fix template escape processing. Fixes #752 .
2013-06-30 12:21:26 -04:00
Tom Morris
a3b4b45e4e
Support non-string types in facetCount() - fixes #591
2013-06-23 12:04:48 -04:00
Tom Morris
51c1bc4a2f
Refactor default toString with date support into separate utility
2013-06-23 12:02:13 -04:00
Tom Morris
c961bb64de
Flush all column caches on row removals/changes. Fixes issue 567.
2013-06-22 18:44:26 -04:00
Tom Morris
fd58bd3327
Move documentation to Javadoc where it's visible
2013-06-22 16:27:18 -04:00
Tom Morris
6e88d068ee
Throw a narrower exception
2013-06-22 16:26:45 -04:00
Jesus M. Castagnetto
0795bd8422
resolved .gitignore conflict
2013-06-19 12:10:32 -05:00
Jesus M. Castagnetto
b09bb4463e
fix error in index caught by thadguidry
2013-06-19 11:21:26 -05:00
Tom Morris
b91fc8a2b1
Use CollationKeys when sorting text. Fixes issue 738
2013-06-17 15:51:29 -04:00
Tom Morris
067fcacec7
Clean up to pass tests:
...
- don't include TAB in control characters which get stripped so we can
use it for splitting
- remove trailing space from normalize strings
2013-05-31 17:06:03 -04:00
Tom Morris
000c0a38a8
Compute delay from request issue, not response return. Fixes #721
2013-05-26 10:13:16 -04:00
Tom Morris
4a5d3d4662
Convert dates to ISO 8601 for reconciliation. Fixes #688 .
2013-05-26 10:08:55 -04:00
Tom Morris
7615db97cf
Add Javadoc clean up variable naming. No functional change.
2013-05-26 10:07:37 -04:00
Tom Morris
36dd95c263
Add TODO for record mode operation
2013-05-26 07:54:33 -04:00
Tom Morris
567da6aa9f
Normalize line endings
...
Add .gitattributes & do one-time normalization of line endings
2013-03-23 18:46:20 -04:00
Tom Morris
6a91b5d75b
Use InputStream instead of Reader for JSON import - fixes #698
2013-03-23 18:36:05 -04:00
Tom Morris
6b3592982e
Remove O(n^2) issue in tree importers - fixes #699
...
- Add sparse/based list implementation for ImportRecord
2013-03-23 12:02:51 -04:00
Tom Morris
f78dfadcf3
Clean up tree import utilities for #699
...
- lazy allocate objects
- conditionalize logging to prevent calls to StringBuilder & toString()
These are secondary issues, but still worth cleaning up.
2013-03-23 11:56:58 -04:00
Tom Morris
0a2ba1b1ae
Switch from LinkedList to ArrayList
...
Just a simple list. No need for extra overhead..
2013-03-23 08:16:23 -04:00
Tom Morris
bfa7c34d17
Merge pull request #659 - closes #659
2013-03-18 21:24:01 -04:00
Tom Morris
31cffa1181
Merge remote-tracking branch 'upstream/master'
2013-03-18 21:16:55 -04:00
Tom Morris
8a61cf731b
Merge pull request #664 from Arcadelia/Preserve_Quotes
...
Quotes should not be removed from values
2013-03-18 18:12:51 -07:00
Tom Morris
fe943fe3ea
Flag English specific stopwords for cleanupp
2013-03-18 20:20:46 -04:00
Tom Morris
7b9f6836e1
Update key & id recon to new Freebase APIs - part of #696
2013-03-12 16:50:23 -04:00
Tom Morris
7578d3375f
Add logger and logging
...
- fix exception printing that goes nowhere
- make logger available for subclasses to use
2013-03-11 13:14:20 -04:00
Tom Morris
a2a8f4af2e
Patch applied - closed #315
2013-03-06 21:45:54 -05:00
Tom Morris
d8d82bf8b7
Clean up a couple more format guessing issues left over from #685
2013-03-06 20:39:39 -05:00
Tom Morris
369bfffb2f
Don't guess field widths unless we have at least 3 lines
...
- Investigation of #685 showed that single line files were being guessed
as fixed field width
2013-03-04 17:47:06 -05:00
Tom Morris
6b676f7513
Handle MIME media types which have charset param - fixes #685
2013-03-04 17:45:34 -05:00
Tom Morris
10bd7e3b75
Make upper bound of time facet inclusive - fixes issue #648
2013-03-03 16:06:20 -05:00
Tom Morris
eba03fc69e
Protect joins map with mutex - fixes issue #652
2013-03-03 09:36:43 -05:00
Tom Morris
7b3379afc7
fix range check in getFields - fixes issue 687
2013-02-26 16:35:21 -05:00
Tom Morris
389e762251
Merge remote-tracking branch 'upstream/master'
2013-02-26 00:01:06 -05:00
Tom Morris
95e13eac50
Improve recon error handling
2013-02-26 00:00:03 -05:00
Tom Morris
50888c6f2e
Merge pull request #666 from Arcadelia/Temp-file_removal
...
Fixed removal of upload temp files
2013-02-11 15:11:24 -08:00
Tom Morris
1033ce973e
TODO about memory usage
2013-02-03 15:56:54 -05:00
Jesus M. Castagnetto
71f3196048
added comment on implementation
2013-02-01 23:45:43 -05:00
Jesus M. Castagnetto
36d2c4ac44
Added full text of BSD 2-clause
2013-02-01 23:44:35 -05:00
Jesus M. Castagnetto
df450b20f7
Registering new XOR command
2013-02-01 22:42:01 -05:00
Jesus Castagnetto
fec35a8bc6
Update main/src/com/google/refine/expr/functions/booleans/Xor.java
2013-02-01 21:07:42 -05:00
Jesus Castagnetto
ebec459cfd
indentation change
2013-02-01 21:00:36 -05:00
Jesus Castagnetto
473e2f367f
Implementing Xor operation
2013-02-01 17:59:16 -08:00
Tom Morris
c0347225b8
Switch escape character from NUL to DEL in hopes that it's rarer.
2013-02-01 17:12:07 -05:00
Frank Wennerdahl
2c59a0059f
Fixed removal of upload temp files
...
Fixed an issue with an unclosed stream preventing upload temp files from
being removed after use. Also removed the use of FileCleaningTracker and
instead added manual removal of all tempfiles. By doing this the reaper
threads in FileCleaningTracker are avoided and files are removed
directly after use.
2013-01-24 09:59:09 +01:00
Frank Wennerdahl
64cf62e081
Fixed history and header update in IE
...
Due to Internet Explorer caching GET requests the Undo/Redo list and
column headers were not updated, leaving essential parts of the user
interface crippled even if Google Frame is installed. Adding
Cache-Control headers to the responses fixes this.
2013-01-24 09:39:12 +01:00
Frank Wennerdahl
1f7ab046c7
Quotes should not be removed from values
...
Leading quotation marks should not be removed from values. If they have
been left by the importing parser they should be considered part of the
value.
2013-01-24 09:04:17 +01:00
Frank Wennerdahl
ebdc40ad71
Added CSV quote options
...
Added two additional CSV options, one for parsing and one for export.
Specifying strict quotes when parsing will ignore all data not quoted.
Specifying quote all when exporting will enclose all values in quotes.
No front-end changes made, just added the support for the options in the
requests.
2013-01-21 08:21:16 +01:00
Frank Wennerdahl
f837643f1e
Support for multi-char-separators in CSV
...
This change requires that the following patch is applied to OpenCSV:
http://sourceforge.net/tracker/index.php?func=detail&aid=3599477&group_id=148905&atid=773543
2013-01-18 16:28:27 +01:00
Tom Morris
33aa1132d7
Clarify wording/naming of blank rows export option - fixes issue #651
...
- clarify that it refers to all non-null cells
- rename variables without compatibility constraints to match actual
function
2013-01-14 16:36:09 -05:00
Tom Morris
0bd2104a16
Issue 630: Change branding from Google Refine to OpenRefine
...
** The first native Github commit (ie not one converted from SVN **
Change Google Refine to OpenRefine or just Refine.
Change icon filenames and add some placeholder icons
2012-10-18 19:40:31 -04:00
Tom Morris
068e0916a2
FIXED - task 587: Correct initialization of the temporary directory - patch from the Wikier project
...
http://code.google.com/p/google-refine/issues/detail?id=587
https://bitbucket.org/wikier/google-refine/changeset/f3dbdb16a320#chg-main/src/com/google/refine/RefineServlet.java
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2583 7d457c2a-affb-35e4-300a-418c747d4874
2012-10-13 15:58:44 +00:00
Tom Morris
4d48741ce0
FIXED - task 574: create safe sheet names for Excel export - patch from jd@tekii.com.ar
...
http://code.google.com/p/google-refine/issues/detail?id=574
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2582 7d457c2a-affb-35e4-300a-418c747d4874
2012-10-12 23:05:17 +00:00
Tom Morris
ca2e959957
FIXED - task 529: Add support for key/value transpose with only two columns as well as repeating key fields in a single record.
...
http://code.google.com/p/google-refine/issues/detail?id=529
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2574 7d457c2a-affb-35e4-300a-418c747d4874
2012-10-05 23:31:25 +00:00
Tom Morris
ffe674729c
Just a little Javadoc. No functional changes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2573 7d457c2a-affb-35e4-300a-418c747d4874
2012-10-05 21:10:32 +00:00
Tom Morris
2c52a00f55
Fixed - issue 544,600,618: Clean up handling of compressed files & archives with multi-segment paths
...
http://code.google.com/p/google-refine/issues/detail?id=600
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2569 7d457c2a-affb-35e4-300a-418c747d4874
2012-09-22 18:08:56 +00:00
Tom Morris
748e205ae8
FIXED - task 616: Support bzip2 decompression on import
...
http://code.google.com/p/google-refine/issues/detail?id=616
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2568 7d457c2a-affb-35e4-300a-418c747d4874
2012-09-22 16:00:42 +00:00
Tom Morris
27e3c0c8dc
FIXED - task 614: Use same instance of OAuthProvider in OAuth dance. Patch supplied by sdeo@google.com
...
http://code.google.com/p/google-refine/issues/detail?id=614
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2566 7d457c2a-affb-35e4-300a-418c747d4874
2012-09-19 23:16:29 +00:00
Tom Morris
b3f5fada95
FIXED - task 578 & 596: Clean up JSON importer
...
http://code.google.com/p/google-refine/issues/detail?id=578
http://code.google.com/p/google-refine/issues/detail?id=596
Extend tree parser framework to allow any Serializable instead of just Strings. Use this in JSON importer to: Import keywords null, true, false; Import empty strings and don't trim whitespace from strings on import; Import numbers directly instead of importing them as text and then parsing them ourselves. Add tests to verify all this stuff
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2543 7d457c2a-affb-35e4-300a-418c747d4874
2012-09-08 01:20:25 +00:00
Tom Morris
93d6e176d6
Task 478: Default "guess datatypes" to False so importers which don't specify it (e.g. gData & Excel) aren't effected
...
http://code.google.com/p/google-refine/issues/detail?id=478
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2541 7d457c2a-affb-35e4-300a-418c747d4874
2012-09-07 21:17:34 +00:00
Tom Morris
83dce305cb
FIXED - task 432: cross() failing - flush join cache table when column changes
...
http://code.google.com/p/google-refine/issues/detail?id=432
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2539 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-30 16:31:22 +00:00
Tom Morris
9b54a8f29e
FIXED - task 559: Deadlock between autosave thread and history code
...
http://code.google.com/p/google-refine/issues/detail?id=559
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2538 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-30 16:22:28 +00:00
Stefano Mazzocchi
ba89daec1c
make oauth against freebase work again in chrome
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2537 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-23 20:58:08 +00:00
Tom Morris
12a61b6ec6
task 603: range check column move commands
...
http://code.google.com/p/google-refine/issues/detail?id=603
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2534 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-18 22:01:23 +00:00
Tom Morris
202018fac4
Add Javadoc. No code changes.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2533 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-18 22:00:41 +00:00
Tom Morris
4bb6c43982
task 604: add Guava to main project so that we're not dependent on an extension
...
http://code.google.com/p/google-refine/issues/detail?id=604
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2531 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-15 13:33:17 +00:00
Tom Morris
1e043dcc94
FIXED - task 604: The common transform “Trim leading and trailing whitespace” doesn’t trim non-breaking spaces
...
http://code.google.com/p/google-refine/issues/detail?id=604
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2529 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-14 23:00:13 +00:00
Tom Morris
f29f77e8f8
STARTED - task 604: The common transform “Trim leading and trailing whitespace” doesn’t trim non-breaking spaces
...
http://code.google.com/p/google-refine/issues/detail?id=604
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2528 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-14 20:09:49 +00:00
Tom Morris
4bf212c03d
FIXED - task 154: Can't import RDF/XML Data
...
http://code.google.com/p/google-refine/issues/detail?id=154
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2526 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-05 16:31:41 +00:00
Tom Morris
5881addac8
Throw an exception if unsupported verb is used
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2525 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-05 15:36:23 +00:00
Tom Morris
b2ae74d23f
FIXED - task 586: Only one parse date format is attempted from list in toDate(format1,format2)
...
http://code.google.com/p/google-refine/issues/detail?id=586
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2520 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-03 18:01:01 +00:00
Tom Morris
4319314675
FIXED - task 594: Date diff function doesn't work for two Calendar objects
...
http://code.google.com/p/google-refine/issues/detail?id=594
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2519 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-02 21:41:19 +00:00
Tom Morris
efa58630cf
Add constructor that takes a Throwable to eliminate redundant code from callers.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2518 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-02 21:38:00 +00:00
Stefano Mazzocchi
2cb31b8b29
fixing oauth problems with redirection for the Freebase API
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2516 7d457c2a-affb-35e4-300a-418c747d4874
2012-08-01 21:46:53 +00:00
David Huynh
4cfb921082
Added getStringKey() method for when it is difficult to generate integer keys that don't collide
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2515 7d457c2a-affb-35e4-300a-418c747d4874
2012-07-19 00:25:41 +00:00
Stefano Mazzocchi
6e41f4ad91
make the latest eclipse happy (it triggers a warning)
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2513 7d457c2a-affb-35e4-300a-418c747d4874
2012-07-12 01:55:11 +00:00
Stefano Mazzocchi
bccea8cebe
we could be leaking file descriptors here
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2506 7d457c2a-affb-35e4-300a-418c747d4874
2012-06-30 07:05:08 +00:00
Stefano Mazzocchi
f84dcff900
moving oauth authorize and deauthrorize into the core module because they are reusable across extensions
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2505 7d457c2a-affb-35e4-300a-418c747d4874
2012-06-29 19:39:42 +00:00
Tom Morris
8872c1b0a1
Keep track of when we have unsaved preference changes
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2502 7d457c2a-affb-35e4-300a-418c747d4874
2012-06-02 21:06:46 +00:00
Tom Morris
a0812c5751
Be slightly more tolerant of weird spreadsheet data
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2501 7d457c2a-affb-35e4-300a-418c747d4874
2012-06-02 21:00:30 +00:00
Tom Morris
c47b1e0ab7
Mark project as modified when metadata is changed
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2491 7d457c2a-affb-35e4-300a-418c747d4874
2012-04-14 14:10:11 +00:00
Tom Morris
8d22ede1f8
Issue 554 - rank formats *before* serializing them.
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2482 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-16 20:21:57 +00:00
Tom Morris
b3f8ce83c1
Issue 553 - Make sure we have a usable filename when importing from a URL
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2481 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-16 20:16:18 +00:00
Tom Morris
51c586bc2c
Issue 543 - Handle HTTP responses with Content-Encoding of gzip
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2480 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-16 20:12:10 +00:00
Tom Morris
a8cb23ca51
Issue 544 - preserve directory path after decompressing file
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2479 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-16 20:06:54 +00:00
Tom Morris
e97e7523b2
Issue 548 - Convert non-strings to strings before escaping
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2463 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-14 03:06:11 +00:00
Tom Morris
18b780bebe
Issue 517 - Fix combin() function to a) increase upper limit and b) keep it from continually recomputing the same values in recursion
...
git-svn-id: http://google-refine.googlecode.com/svn/trunk@2459 7d457c2a-affb-35e4-300a-418c747d4874
2012-03-08 22:53:21 +00:00