History
Log In
h
ome
b
rowse project
f
ind issues
Q
uick Search:
Learn more about
Quick Search
Filter:
View
Edit
New
Manage
You are currently using a new, unsaved search.
Summary
Project:
Tika
Sorted by:
Key descending
Operations
Issue Navigator
[
Permlink
]
Displaying issues
1
to
50
of
338
matching issues.
Current View:
Browser
(
Current Fields
|
Printable
|
Full Content
)
|
XML
| RSS
(
Issues
|
Comments
)
|
Word
| Excel
(
All fields
|
Current fields
)
1
|
2
|
3
|
4
|
5
|
6
|
7
|
Next >>
T
Key
Summary
Assignee
Reporter
Pr
Status
Res
Created
Updated
Due
TIKA-338
Trying to use -encoding parameter alwyas results in an exception
Unassigned
Peter Wolanin
Closed
Invalid
27/Nov/09
27/Nov/09
TIKA-337
SWF parser
Unassigned
Julien Nioche
Open
UNRESOLVED
27/Nov/09
27/Nov/09
TIKA-336
More issues with RDF mime detection
Chris A. Mattmann
Chris A. Mattmann
Resolved
Fixed
25/Nov/09
25/Nov/09
TIKA-335
TXTParser should use incoming charset
Unassigned
Ken Krugler
Open
UNRESOLVED
25/Nov/09
25/Nov/09
TIKA-334
HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag
Unassigned
Ken Krugler
Open
UNRESOLVED
25/Nov/09
25/Nov/09
TIKA-333
Improve accuracy of charset detection for HTML pages
Unassigned
Ken Krugler
Closed
Not A Problem
25/Nov/09
25/Nov/09
TIKA-332
Use http-equiv meta tag charset info when processing HTML documents
Unassigned
Ken Krugler
Open
UNRESOLVED
25/Nov/09
25/Nov/09
TIKA-331
Windings font recognition in Tika parsing + spacing issue
Unassigned
MRIT64
Open
UNRESOLVED
24/Nov/09
24/Nov/09
TIKA-330
Better HWP (Hangul Word Processor) detection pattern
Jukka Zitting
Jukka Zitting
Resolved
Fixed
23/Nov/09
23/Nov/09
TIKA-329
secure-processing not supported by some JAXP implementations (2)
Unassigned
Julien Nioche
Open
UNRESOLVED
20/Nov/09
20/Nov/09
TIKA-328
Add parser for .flv videos
Unassigned
Sami Siren
Open
UNRESOLVED
19/Nov/09
19/Nov/09
TIKA-327
Parsing "HTML" as DcXML
Unassigned
Erik Hetzner
Open
UNRESOLVED
18/Nov/09
18/Nov/09
TIKA-326
Map javax.imageio.IIOException to TikaException
Jukka Zitting
Jukka Zitting
Resolved
Fixed
17/Nov/09
17/Nov/09
TIKA-325
tika-parent/pom.xml missing <inceptionYear>2007</inceptionYear>
Jukka Zitting
Luke Nezda
Resolved
Fixed
15/Nov/09
17/Nov/09
TIKA-324
Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X)
Jukka Zitting
Peter Wolanin
Resolved
Fixed
15/Nov/09
27/Nov/09
TIKA-323
Make Tika site look like Lucene ecosystem Apache Forrest-built sites
Chris A. Mattmann
Chris A. Mattmann
Open
UNRESOLVED
14/Nov/09
14/Nov/09
TIKA-322
Improve encoding detection speed and accuracy
Unassigned
Jukka Zitting
Open
UNRESOLVED
13/Nov/09
25/Nov/09
TIKA-321
Optimize type detection speed
Unassigned
Jukka Zitting
Open
UNRESOLVED
13/Nov/09
13/Nov/09
TIKA-320
Allow disabling language detection in AutoDetectParser
Jukka Zitting
Erik Hetzner
Resolved
Fixed
12/Nov/09
16/Nov/09
TIKA-319
HtmlParser - use encoding hint only if charset is supported
Jukka Zitting
Piotr B.
Resolved
Fixed
12/Nov/09
13/Nov/09
TIKA-318
Upgrade nekohtml dependency from 1.9.9 to 1.9.13
Jukka Zitting
Attila Király
Resolved
Invalid
07/Nov/09
13/Nov/09
TIKA-317
Annotation-based Tika configuration
Jukka Zitting
Jukka Zitting
Open
UNRESOLVED
07/Nov/09
07/Nov/09
TIKA-316
Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length)
Unassigned
Mike Hays
Open
UNRESOLVED
03/Nov/09
13/Nov/09
TIKA-315
Tika appears to skip over an entire section of a Microsoft Word Document
Unassigned
Sanjeev Rao
Open
UNRESOLVED
26/Oct/09
13/Nov/09
TIKA-314
Initial support for JPEG EXIF metadata extraction
Jukka Zitting
Maxim Valyanskiy
Resolved
Fixed
20/Oct/09
07/Nov/09
TIKA-313
patch: ODF improvements for svg:desc, presentation notes
Jukka Zitting
Bart Hanssens
Resolved
Fixed
17/Oct/09
13/Nov/09
TIKA-312
TikaCLI can't print metadata
Jukka Zitting
Maxim Valyanskiy
Resolved
Fixed
16/Oct/09
16/Oct/09
TIKA-311
Broken handling of <a name="..."/> tags
Jukka Zitting
Jukka Zitting
Resolved
Fixed
14/Oct/09
14/Oct/09
TIKA-310
Use TagSoup to parse HTML
Jukka Zitting
Jukka Zitting
Resolved
Fixed
14/Oct/09
14/Oct/09
TIKA-309
Mime type application/rdf+xml not correctly detected
Chris A. Mattmann
Yuan-Fang Li
Resolved
Fixed
13/Oct/09
25/Nov/09
TIKA-308
Improve supertype handling in type registry
Unassigned
Ken Krugler
Open
UNRESOLVED
11/Oct/09
11/Oct/09
TIKA-307
Better handling of partial/truncated input data to parsers
Unassigned
Ken Krugler
Open
UNRESOLVED
10/Oct/09
10/Oct/09
TIKA-306
patch: OOXMLParserTest uses OpenOfficeParser
Unassigned
Bart Hanssens
Resolved
Fixed
09/Oct/09
16/Oct/09
TIKA-305
XHTML href attributes end up in the wrong namespace
Jukka Zitting
Benson Margulies
Resolved
Fixed
09/Oct/09
16/Oct/09
TIKA-304
HtmlParser could be easier to subclass
Jukka Zitting
Benson Margulies
Resolved
Fixed
09/Oct/09
16/Oct/09
TIKA-303
XHTMLContentHandler mishandles headers
Jukka Zitting
Benson Margulies
Resolved
Invalid
08/Oct/09
16/Oct/09
TIKA-302
patch: initial support for ePUB
Jukka Zitting
Bart Hanssens
Resolved
Fixed
07/Oct/09
16/Oct/09
TIKA-301
patch: embedded ODF and office:annotation
Jukka Zitting
Bart Hanssens
Resolved
Fixed
05/Oct/09
16/Oct/09
TIKA-300
rename openoffice.. parser classes to odf..
Jukka Zitting
Bart Hanssens
Resolved
Fixed
05/Oct/09
16/Oct/09
TIKA-299
Update Geronimo dependency in tika-parsers pom.xml to 1.0.1
Jukka Zitting
Ken Krugler
Resolved
Fixed
30/Sep/09
30/Sep/09
TIKA-298
CompositeParser.getParser() should use mimetype hierarchy when falling back
Unassigned
Ken Krugler
Open
UNRESOLVED
30/Sep/09
07/Nov/09
TIKA-297
The HtmlParser ignores <menu> tags, resulting in invalid XHTML
Jukka Zitting
Ken Krugler
Resolved
Fixed
29/Sep/09
30/Sep/09
TIKA-296
Automatically set the supertype for "+xml" mimetypes
Jukka Zitting
Ken Krugler
Resolved
Fixed
28/Sep/09
11/Oct/09
TIKA-295
Rough cut of mbox parser
Jukka Zitting
Ken Krugler
Resolved
Fixed
28/Sep/09
14/Oct/09
TIKA-294
TikaCLI always uses System.in for input
Jukka Zitting
Ken Krugler
Resolved
Fixed
28/Sep/09
02/Oct/09
TIKA-293
XWPFWordExtractorDecorator does not extract bookmarks
Jukka Zitting
Maxim Valyanskiy
Resolved
Fixed
28/Sep/09
02/Oct/09
TIKA-292
PDFBox is too verbose
Jukka Zitting
Jukka Zitting
Resolved
Fixed
28/Sep/09
28/Sep/09
TIKA-291
Adobe InDesign support
Unassigned
Jukka Zitting
Open
UNRESOLVED
27/Sep/09
27/Sep/09
TIKA-290
org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.txt.TXTParser@6caf16
Jukka Zitting
MRIT64
Resolved
Fixed
27/Sep/09
02/Oct/09
TIKA-289
Add magic byte patterns from file(1)
Unassigned
Jukka Zitting
Open
UNRESOLVED
27/Sep/09
27/Sep/09
1
|
2
|
3
|
4
|
5
|
6
|
7
|
Next >>