forked from jaeksoft/opensearchserver
-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathCHANGELOG.txt
748 lines (640 loc) · 30.1 KB
/
CHANGELOG.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
OpenSearchServer
http://www.opensearchserver.com
-----------------------
CHANGE LOG
All new features and bugs are described here:
https://github.com/jaeksoft/opensearchserver/issues (GH)
http://sourceforge.net/p/opensearchserve/feature-request/ (SF)
http://sourceforge.net/p/opensearchserve/bug-report/ (SF)
OpenSearchServer 1.6
Enhancement:
- GH-1900: Allow leading wildcard in Pattern query
- GH-1881: Support of array in database crawler
- GH-1879: Parsing of BLOB in the Database crawler
- GH-1870: Crawl cache should use compression
- GH-1869: Replace CXF by Jersey
- GH-1722: ERROR: root - The build is already running
Bug fixes:
- GH-1877: ClosedChannelException on Terms extraction
- GH-1857: Update htmlunit
- GH-1843: sitemap.xml fetch frequency
- GH-1840: Merging index in Scheduler not exist anymore
- GH-1641: Faceted fields returned in different orders
OpenSearchServer 1.5.14
New features:
- GH-1796: Silent backup replication
- GH-1791: Disabling the Scheduler, the WebCrawler and the FileCrawler
- GH-1780: Create a Scheduler using Rest API
- GH-1773: Crawler - API to delete URL browser
- GH-1768: Limit the crawl depth in the web crawler
- GH-1756: Crawler Status for All Crawlers trough API Call
- GH-1755: Add and Remove sitemaps from index trough the REST API / php client
- GH-1749: Web service method for web properties
- GH-1746: Add HTTP connection time out
- GH-1706: Autocompletion thread should be interruptible
- GH-1700: Tomcat version update 7.0.68
- GH-1689: Disabling link detection
- GH-1676: Support of PhantomJS in the HTMLParser
- GH-1671: Add a customizable char tokenizer
- GH-1669: An XML/JSON API for field terms extractions
Bug fixes:
- GH-1745: Can't access "Privileges" tab in last snapshots
- GH-1729: org.apache.cxf.interceptor.Fault, Tomcat 7.0.53, when starting crawl
- GH-1697: Ignored Number of suggestion for auto-completion
- GH-1697: Scheduler URL database error
- GH-1674: Faulty cookie support with HttpClient
- GH-1659: Crawler deadlocks after running while serving requests
OpenSearchServer 1.5.13
New features:
- GH-1653: REST Crawler - HTTP Header authentication method
- GH-1652: REST Crawler - an integrated sequence for paginated APIs support
- GH-1648: File event scheduler task
- GH-1635: For SMB crawling and AD login SID should also be extracted
- GH-1624: Add the field host in the field mapping of the file crawler
- GH-1617: The REST crawler should support local files indexation
- GH-1611: Support of multiple indexes
- GH-1601: Use compressed bit set for less memory consumption
- GH-1598: Facets can be limited in number
- GH-1597: Facets can be sorted by term or count
- GH-1595: Parallel sorting to better use multi-core system
- GH-1582: Programmatically retrieving hosts from Web index
- GH-1562: Fault tolerancy in the database crawler
- GH-1559: Handle short date format in sitemap (yyyy-mm-dd)
- GH-1541: Ability to disable obeying robots.txt using the API
- GH-1537: Collapsing size to 0 merges the returned field
- GH-1524: Add fetch size parameter in the database crawler
- GH-1502: A scheduler task able to launch a crawl script
- GH-1480: Multithreaded OCR
- GH-1476: New index template for credentials storing
- GH-1475: Add an encryption filter
- GH-1471: Authentication based on external index
- GH-1470: Renderer: Add range to date facet
Bug fixes:
- GH-1592: The pattern inclusion API (v2) failed to inject the URLs in the URL database
- GH-1509: Hunspell native implementation is slow
- GH-1494: Error HttpHostConnectException should not abort the crawl session
OpenSearchServer 1.5.12
New features:
- GH-1453: The log should be less verbose by default
- GH-1452: writeLockTimeout should be configurable
- GH-1449: Renderer : display fields from a join
- GH-1441: API : delete an analyzer
- GH-1323: REST API - More information about Indexes
- GH-1149: "java.io.IOException: Encrypted PDF" and others files-related error: include filename
Bug fixes:
- GH-1466: Multivalued facets are broken when using fields without term vectors
- GH-1458: The "fetch first" policy is no more applied
- GH-1455: Outer join is broken
- GH-639: File crawler: Crawls only 20% of the files
OpenSearchServer 1.5.11
New features:
- GH-1444: Add attribute option to script function XXX_IndexField
- GH-1443: New function AddNow in script module
- GH-1439: Multithreaded facets
- GH-1424: Change priority order with the charset detection in the HTML parser
- GH-1416: Add a test button in the renderer to check the authentication
- GH-1413: Parser for EML Mime Mail messages
- GH-1412: Let the parsing API decides which parser to use
- GH-1404: Define a per query behavior for batch of queries
- GH-1394: The collapsed documents should be fully visible when the size parameter is set to 0
- GH-1390: Search fields should grouped as boolean queries
- GH-1388: Analyzer: create a filter for applying various transformation to proper nouns
-Bug fixes:
- GH-1437: Broken image in screenshot capture
- GH-1434: Cannot put a negative value in the geo parameters user interface
- GH-1430: The file crawler recursive parsing misses some files
- GH-1427: The LDAP and JCIFS records do not match when querying an Active Directory
- GH-1414: Problem with special characters encoding when editing synonyms or stopwords on Windows
- GH-1406: Scoring, boosting query: not working?
- GH-1402: Empty query is not well evaluated in Search Field
- GH-1400: In search requests, the "Empty return all" attribute is not persistent
- GH-1397: ResultSet closed error when using SQL update with the database crawler
OpenSearchServer 1.5.10
New features:
- GH-1365: Search field should support the field:term syntax in pattern mode
- GH-1363: Subqueries (filters) should be processed in parallels using threads
- GH-1361: Faster cache implementation
- GH-1336: URL fragments filtering
- GH-1332: REST API - More information about Indexes
- GH-1255: Query: send batch
- GH-1253: Term filter: it should be possible to use operator OR
- GH-1213: Renderer: ability to choose to reset facets when a new search is made
- GH-1290: Scheduler: add a task for searching, transforming and sending content
- GH-1162: Search API: be able to give parameter for Boosting queries
- GH-1093: Replication: add an API to at least list existing replications
- GH-963: Scheduler: change link to documentation for CRON expression
- GH-923: Be able to pass variables to a database crawler via a scheduler job
- GH-598: Support of MongoDB indexing
- GH-298: Web crawler: crawl URL "new" in parallel to the "old" ones
Bug fixes:
- GH-1376: The monitoring upload task uses an erroneous POST format
- GH-1348: Some scheduler tasks cannot be stopped
- GH-1126: Database crawler: deletion: error No field has been given
OpenSearchServer 1.5.9
New features:
- GH-1298: Crawl remote file system using the database crawler
- GH-1269: Prepare interfaces for native bit set
- GH-1251: With large host list, the Web crawler should select the host randomly
- GH-1249: Optimize file parsing during database crawl
- GH-1233: Add time out to the autocompletion task
- GH-1225: Hunspell suggest performances
- GH-1211: JDBC cursor size (setFetchSize) should be set with the buffer size
- GH-1207: Take a different query string for each field
- GH-1188: Execute a Groovy script when a crawl thread starts
- GH-1183: Add a Czech stemmer
- GH-1179: a REST API to disable or enable the pattern lists
- GH-1152: Parser JSON API
- GH-1139: The "Manual Crawl "API should return the index document
- GH-1135: Spell check filter
- GH-1122: A keyword analyser with lower case and diacritic removal
- GH-1114: Control of crawl cache in manual crawl
- GH 1105: Add mailbox crawl scheduler task
- GH-1054: Support of Content-Disposition filename parameter
- GH-716: Ability to enable crawl cache by index instead of for whole instance
- GH-266: Multiple proxies
Bug fixes:
- GH-1316: Removal of search, filter, termvector and field cache
- GH-1305: 30 minutes time out on file crawler when no new files are fetched causes trouble
- GH-1243: RPM installation failed
- GH-1241: Web crawler: inclusion and exclusion list not followed when reading sitemap
- GH-1229: The PUT method of the Analyzer API generates duplicates
- GH-1202: Exception with RegularExpressionReplaceFilter when the replacement string is empty
- GH-1150: The MirrorAndFilter is not supported in the JSON API
- GH-1118: Query REST API, Geo parameters don't support overriding
OpenSearchServer 1.5.8
New features:
- GH-1077: Imap and Pop3 mailbox crawler
- GH-1064: Renderer authentication with HTTP headers
- GH-1060: Authentication should also run with rights stored in an external index
- GH-1053: File and Web template harmonisation
- GH-1035: Add "open folder" link in the tools widget of the renderer
- GH-1016: Renderer: ability to choose several fields
- GH-1005: A REST API for CRUD operations on the analyzers
- GH-884: Bootstrap templates in the renderer
OpenSearchServer 1.5.7
New features:
- GH-1017: The index buffer of the web crawler is fixed to 1000 documents enhancement
- GH-990: Keyword highlight should also work on fields without term vectors
- GH-979: New Delete Sync scheduler task
- GH-967: Default credentials for the database crawler
- GH-955: Reloading an index when a configuration file is changed
- GH-934: File Crawler, add Time and Parser information
- GH-920: Mirror AND Filter
- GH-918: Add sort.clear parameter to API v1
- GH-908: Allow aliases for usernames
- GH-901: Allow selecting multiple facets in the renderer
- GH-899: Datetime widget for the renderer
- GH-896: Facet merging filter in the renderer
- GH-894: Date filter widget for the renderer
- GH-887: Incremental replication should not require twice the size of the index enhancement
- GH-885: New URLWrap widget for the renderer
- GH-873: Support of Active Directory SSO (NTLM)
- GH-869: Be able to set a fail over parser
- GH-349: Be able to clone a database crawler (to not start from scratch)
Bug fixes:
- GH-891: Bad viewer URL when using proxy
OpenSearchServer 1.5.6
Bug fixes:
- GH-840: Infinite loop while crawling HTML files
- GH-836: No error in log when the database crawler fails
- GH-832: NPE at Client.java:312
- GH-826: Bug with SuggestionAnalyzer and query using a Term&Phrase mode
- GH-820: PhoneticFilter: first query returns empty value
OpenSearchServer 1.5.5
New features:
- GH-803: Multivalued fields are not visible in the renderer
- GH-775: Control the timeout parameter on replication
- GH-719: Web crawler, exclusion patterns: add a button "delete all"
Bug fixes:
- GH-774: PDF Viewer failure with SMB urls
- GH-761: Scheduler clone does not clone the included tasks
- GH-751: Web frontend: search for quoted values breaks html
- GH-737: Web crawler: cookies are not working well
OpenSearchServer 1.5.4
New features:
- GH-710: Delete document returned by a search request
- GH-704: Support of multi-words synonyms
- GH-681: Add a timeout parameter to the "WebCrawler - stop" scheduler task
- GH-653: Improve locking isolation in config.
- GH-647: Improve performance under heavy concurrent load
- GH-646: Be able to delete a corrupted index
- GH-632: JCIFS NTLM implementation in Web Crawler
- GH-626: Export archive of selected content from the crawlcache
- GH-589: Web crawler and proxy: provide fields for username and password
Bug Fixes:
- GH-739: Web crawler: language detection does not work with <html lang='xx'>
- GH-690: Searching with GeoFilter and MILES still uses KILOMETERS to search wihtin a radius
- GH-674: Better support of malformed URL
- GH-642: Updating an index during a replication may break the replicated index
- GH-639: File crawler: Crawls only 20% of the files
- GH-605: The Web crawler sometimes freezes when the HTML parser ignores canonical URLs
- GH-597: SMB crawling - Illegal character in path
- GH-596: Error popup when clicking on an index in admin GUI
- GH-523: lastModifiedDate not recorded in 1.5.2
OpenSearchServer 1.5.3
New features:
- GH-561: Add Tomcat server logs
- GH-551: Non-existent data directory throws unhelpful error
- GH-508: Add sort option in the renderer
- GH-14: Crawler: pattern URL: allow use of wildcard in hostnames
OpenSearchServer 1.5.2
New features:
- GH-525: Add Logout button to renderer
- GH-477: Support of radius in geo filter
- GH-451: Regular expression filter with replacement
- GH-409: XPath Filter
- GH-403: Sorting should support empty values
- GH-384: Analyzers should provide separated tokenizers for queries and indexations
- GH-359: Add logon window when using the NTLM authentication in the renderer
- GH-358: The SMB URLs should be translatable as FILE:///
- GH-350: Replication should be able to only synchronize the schema
- GH-339: Support of AND, OR NOT operator in search field requests
- GH-334: The snippets should support phrase queries
- GH-315: Support of PDFCrack in PDFParser
- GH-296: Returns distance when using GeoFilter
- GH-292: Be able to abort database and scheduler process
- GH-271: Query of type "Search (field)" should allow search with phrase query only
Bug Fixes:
- GH-523: Web crawler buffer not flushed on large set of URLs
- GH-473: Auto-completion build freezes the web interface
- GH-345: Issue when changing the name of a replication item
- GH-305: Web crawl returns "Error - Not Allowed" when using a proxy
- GH-299: Error reading 'driverClassList
OpenSearchServer 1.5.1
New features:
- GH-289: External process for Parsers
- GH-279: New Term filter and Phrase filter
- GH-265: New XmlXPathParser
Bug Fixes:
- GH-293: Issue with Geolocation filter with negative coordinates
- GH-264: URL browser: "delete selected URL" action deletes every URL
- GH-262: Missing libraries
- GH-261: Crawler/parser transform letter with accent into html entities
OpenSearchServer 1.5
New features:
- GH-202: Named Entity Recognition module
- GH-183: Script REST API
- GH-2: Copy Of: ability to copy from several fields
- GH-1: Improve spell check query: return existing words
- SF-288: Copy content of a field to another one in the schema
- SF-287: Indexation using text file
- SF-286: Automatically create a unique reference for each document
- SF-285: Hunspell integration
- SF-283: Gzip file support in sitemap
- SF-277: Separate query for snippets
- SF-272: Automatically set cookies in the web crawler
- SF-271: Disable nofollow directive
- SF-270: Database based script
- SF-269: Upgrade ZK library (6.5.2)
Bug fixes:
- GH-171: Highlighted attribute is not set in Search API (XML/HTTP)
- SF-190: Cancel "Edit job" page in scheduler when changing selected index
- SF-189: Do not display error of previous crawl process in "current status"
OpenSearchserver 1.4
New features:
- SF-266: Merge multiple indices
- SF-265: Retaining Search - SF-Switch Between Renders
- SF-260: Add Visio parser and Publisher parser
- SF-257: Upgrade Apache CXF to 2.7.x
- SF-251: A report module
- SF-250: Server time in scheduler interface
- SF-249: Extraction of user and group rights when crawling CIFS/SMB and FTP
- SF-248: Upgrade ZK CE to 6.5
- SF-247: Support of additional characters in LetterOrDigitTokenizer
- SF-246: Named entity recognition based on a search request
- SF-245: Highlighting in PDF documents and images (OCR)
- SF-243: RSS Parser
Bug fixes:
- SF-187: ICEPdf parser puts brackets to the indexed words
- SF-186: ICEPdf parser: only the first page is indexed
- SF-185: A more efficient Classifier loader
- SF-184: NullPointerException on SnippetField.copySnippetFields
- SF-183: maxClauseCount is set to 1024 when deleting large collection
- SF-178: Error reading 'fieldOrSnippetList' on type RendererController
- SF-170: Wrong file path in file crawler
- SF-169: java.lang.NullPointerException on flush expired contend from cache
- SF-168: Screenshot function not working
- SF-167: java.lang.NullPointerException on enabling crawler cache
- SF-166: Query -> More like This causes Property 'name' not found on type java.lang.String
- SF-165: Not possible to copy an address in Crawler -> Web -> Pattern List
- SF-163: Weight is ignored in query scoring item
- SF-161: ParseException while crawling files
OpenSearchServer 1.3.1
New features:
- SF-244: Exportable autocompletion index
- SF-242: Be able to run only one crawl session
- SF-240: Several HTML parser configuration per index
Bug fixes:
- SF-158: File crawler infinite loop while crawling FTP directory
- SF-157: NullPointerException while deleting documents from web crawler
- SF-156: Cannot forcefully unlock a NativeFSLock error
- SF-155: Trigger name cannot be null error when clicking optimize
- SF-153: Failed replication from Windows to Linux because of file separator
OpenSearchServer 1.3
New features:
- SF-238: Job called when crawl session ends
- SF-237: Transaction isolation level should editable
- SF-236: Default value in the Classifier
- SF-235: Named entity recognition
- SF-234: Free text submission to MoreLikeThis query
- SF-233: Add deletion support in XML indexation
- SF-232: FTP feed XML task
- SF-231: Autocompletion API
- SF-230: Dropbox indexation
- SF-228: Group all token filter
- SF-227: Squared Geographical Filter
- SF-226: Json Path filter
- SF-225: Degrees to radians converter
- SF-224: A delete all feature
- SF-223: A flush crawl cache scheduler task
- SF-222: Disable Meta NoIndex on HTML parser
- SF-221: Dailymotion and Vimeo API support
- SF-220: Youtube API support
- SF-218: URL and FILE database scheduler task
- SF-217: Tag boost in HTML parser
- SF-216: A number format filter
- SF-214: JBIG2 format support for PDF text extraction
- SF-210: Classifier feature
- SF-209: SOAP WebService
- SF-206: Image Parser with OCR
- SF-204: XSL support on XML upload
- SF-3510432: Support of Arabic language
- SF-202: OCR Integration
- SF-197: Adding regular expression capture in parser
- SF-187: Jsoup HTML parser
- SF-179: Consecutive duplicate characters filter
- SF-177: Phonetic filters
- SF-176: A filter to extract the domain information
- SF-175: Multiple OpenSearchServer instances on one Tomcat server
- SF-174: Adding NumDeletedDocs in Runtime/Index
- SF-166: Upgrade to ZK 5.0.10
- SF-162: Upgrade Tomcat to version 6.0.35
- SF-3421240: JSON response for OpenSearchServer
- SF-151: OpenSearch Standards
OpenSearchServer 1.2.4
New features:
- SF-3527810: Removing fragment in URL
- SF-3518687: URL normalization required in the Web crawler
- SF-3510598: Add CSV format in the select XML/HTTP API
- SF-3498421: Set a default URL value for replication
- SF-3498373: Remove script and object tag content in db crawler
- SF-3495004: Add progress info in scheduler task
- SF-3490031: Asynchronous XML upload
- SF-3489935: Default stop words list
- SF-3489934: A default autocomplete field and analyzer
- SF-3489873: A scheduler task to copy terms to an index
- SF-3489642: Unable to scroll all the log in runtime/logs panel
- SF-3487772: Advanced spell check
- SF-3487219: RegExp filter for DatabaseCrawler
- SF-3483920: BoostQuery support
- SF-3483681: Add a download button for log files
- SF-3482284: LimitInputStream should be buffered
- SF-3477836: A separated error log for the parser
- SF-3469777: Scored snippets
- SF-3466445: Upgrade Apache POI to 3.8
Bug fixes:
- SF-3531057: crawling always fails when using proxy
- SF-3531279: Epected='obj' actual='o' error while parsing the PDF
- SF-3527244: URL filter does not work for jsessionid
- SF-3521414: File not parsed in database crawler
- SF-3516114: NPE with Web Crawler when no unique field is set
- SF-3509333: Error 500 when loading robots.txt
- SF-3500288: Wrong facet count when using optimized collapsing
- SF-3498607: NPE when collapsing is enable with no documents found
- SF-3495681: StringIndexOutOfBoundsException on snippet
- SF-3489854: A faster indexation when using XML upload
- SF-3489550: Schema API gives IllegalStateException
- SF-3485241: Pages with meta noindex are indexed
- SF-3483798: HTML parser prints many logs
- SF-3483259: Underscore added when using synonyms on expression
- SF-3482151: Monitor API response gives error
- SF-3479649: SetField and DeleteField API don't update config.xml
- SF-3479548: Report log files are truncated
- SF-3476578: UTF-8 character as separator in snippet field
- SF-3475446: head indexed instead of body (using <script /> in head)
- SF-3476551: Query with Quotation Marks has unusual results
- SF-3474877: URL Filter doesn't work for jsessionid
- SF-3472905: Error while crawling local HTML file
- SF-3471749: Out Of Memory Error while indexing PDF document
OpenSearchServer 1.2.3
New features:
- SF-3431741: Fault tolerant parser on XML upload
- SF-3429967: Scheduler task: monitoring upload
- SF-3419014: Delete by query task in scheduler
- SF-3396935: Log files viewable in the user interface
- SF-3393751: XML load sheduler task
- SF-3393063: Upgrade to SeleniumHQ 2.0.4
- SF-3393052: Upgrade to NekoHTML 1.9.15
- SF-3393047: Upgrade to TagSoup 1.2.1
- SF-3393046: Upgrade to PDFBox 1.6.x
- SF-3393032: Frameset support
- SF-3392359: Drupal module for OpenSearchServer
- SF-3380911: Scheduler informations in home view
- SF-3370376: Parser have editable parameters
- SF-3366055: URLBrowser API
- SF-3353874: OSS could be a great sitemap generator
- SF-3356222: Export the search result into csv file
- SF-3344564: Adding Snippet Field Size and Tag from the API
- SF-3310490: File Crawler API
- SF-3310489: Web Crawler API
- SF-2958005: Upgrade Apache HttpClient library to 4.1.1
Bug fixes:
- SF-3429305: NullPointerException while index Word document
- SF-3426912: Authorization error when loading binary with XML Load
- SF-3419013: Scheduler job end date error
- SF-3325102: Bug Report Disabled Robots doesn't work
- SF-3409814: Error: The document is really a RTF file
- SF-3408193: Password of the XMLLoad task is not encrypted
- SF-3406657: File crawler error: unknown protocol: smb
- SF-3406378: SQLite supports only TRANSACTION_SERIALIZABLE
- SF-3396778: Word 95 or older format are not indexed
- SF-3386593: LimitException when reading Robots.txt
- SF-3380949: NPE in Renderer
- SF-3376106: The mapping of inlink and outlink is broken
- SF-3346206: NPE when Fragmenter is set through API.
- SF-3323250: Invalid XML when the query includes an ampersand
- SF-3316826: Optimize API don't return any feedback
- SF-3316820: Action API returns a blank page
- SF-3310825: WriteAbortedException in catalina.out when Tomcat restarts
- SF-3310716: Index deletion and re-creation issue
- SF-3305045: Malformed base tag causes crawling error
OpenSearchServer 1.2.2
New features:
- SF-3307328: Update notification
- SF-3300872: Scheduler task list extensible
- SF-3287318: Upgrade to seleniumhq 2.x
- SF-3266524: Return terms from non stored fields
- SF-3266520: Cache management in Web UI
- SF-3253614: Upgrade to PDFBox 1.5.x
- SF-2973374: Upgrade to ZKoss 5.0.x
- SF-3213190: Negative filtering
- SF-3175074: Screenshot of a webpage
- SF-3094822: SiteMap functionnality
Bug fixes:
- SF-3307873: Runtime/System panel not visible
- SF-3290939: NPE when no default field is selected
- SF-3278815: The language is not detected in PDF files
- SF-3192429: Not the correct definition of the Russian language
OpenSearchServer 1.2.1
New features:
- SF-3176150: time/date stamp
- SF-3186042: Disable not being allowed (robots.txt)
- SF-3182953: Crawl URL from a database
- SF-3182950: Pattern and exclusion list deactivable
- SF-3182097: Adding field boost support
- SF-3175585: More like this feature
- SF-3169421: Japanese and Korean support
- SF-3159477: Identify identical web pages
Bug fixes:
- SF-3178432: Wrong cron values in the scheduler
OpenSearchServer 1.2
New features:
- SF-3151757: Upgrade to PDFBox 1.4.x
- SF-3141193: FTP/FTPS support in the file crawler
- SF-3141192: SMB/CIFS support in file crawler
- SF-3034238: Crawler able to log in protected web site
- SF-3011773: Add Quartz as scheduler service
- SF-3138603: Upgrade Tomcat to version 6.0.xx
- SF-3103055: Convert HTML entities
- SF-3087916: Upgrade to PDFBox 1.2
- SF-3043692: Torrent Parser
- SF-3042488: Audio parser
- SF-2882260: Add a parser for text/plain
- SF-3010010: RTF parser
- SF-3038733: Add a shingle filter
- SF-3036262: Log management
- SF-3031800: Schema interface
- SF-3031204: Adding NGram support
- SF-3008440: Index replication
- SF-3026212: API and interface for document deletion
- SF-3023327: Sub domain extraction in Web Crawler
- SF-2820289: Database crawler
- SF-3019035: Neglight dynamic url while crawling
- SF-3017277: Allow wildcard query in the URL browser
- SF-3016491: Url Exporter
- SF-3016566: Monitoring API
- SF-3015939: Cluster collapsing
- SF-2830490: Size of the index
- SF-3011847: Score explanation
- SF-3008633: Possibility to turn off the highlighting
- SF-2997836: In the returned fields having extra fields from Meta tags
- SF-2997826: Possibility to index only the specified content
- SF-2991252: Possibility to index binary file and to add it to a document
- SF-2982545: Extracting term frequency informations
- SF-2881385: API to retrieve the available indice
- SF-2887376: Enhancement for the index page - SF-dropping indexes
- SF-2881388: API to list/create/modify fields in a specified schema
- SF-2973374: Upgrade to ZKoss 5.0.5
- SF-2970747: Upgrade Tomcat to version 6.0.26
- SF-2966139: Statistics lost when OSS restart
- SF-2964704: Upgrade to Lucene 2.9.x
- SF-2958015: Add source archive
- SF-2958005: Upgrade Apache HttpClient library to 4.0.1
- SF-2956498: Provide a way to send statistics report by email
- SF-2953803: Upgrade to PDFBox 1.0
- SF-2953802: Upgrade to POI 3.6
- SF-2953575: Charset detection should look at meta http-equiv
- SF-2953524: Specify default charset for parser
- SF-2929332: Faceting post collapsing
- SF-2900462: Upgrade POI to 3.5 for xlsx and docs support
- SF-2900449: Upgrade PDFBox to 0.8
Bug fixes:
- SF-3104065: File crawler crashes with java.io.EOFException
- SF-3090248: Statistics configuration lost when adding fields
- SF-3051308: <base href= ""/> is not interpreted
- SF-2881689: Requests.xml fails to rotate on some Windows platforms
- SF-3019491: NullPointerException > at java.util.regex.Matcher
- SF-3017481: The web crawler selects the host in alphabetical order
- SF-3015838: Web crawler problem with UTF-8 BOM encoding
- SF-2993103: NoClassDefFoundError BouncyCastleProvider
- SF-2990960: keyword are not highlighted in snippets
- SF-2982541: Phrase synonyms generate unwanted words
- SF-2934214: Shifted highlighting on snippet
OpenSearchServer 1.1.4
New features:
- SF-2995116: Replace Javamail library
OpenSearchServer 1.1.3
Bug fixes
- SF-2989836: Statistics are not set up in template
- SF-2988425: Crawler is not obeying disallow directives from robots.txt
- SF-2988433: Crawler ignores query part of URL when evaluating robots.txt
Changes:
- SF-2989831: Update the documentation link (wiki)
OpenSearchServer 1.1.2
New features:
- SF-2910029: Lucene read only support
Bug fixes
- SF-2885897: PHP API fail on some query with wrong xml chars
- SF-2820286: Issue with statistics aggregation
- SF-2911906: Sort functionnality seem to ignore sort order
- SF-2908345: Performance issue with large field cache
- SF-2905076: Issue with performance of web crawler
- SF-2903935: Negative value on web crawler statistics
- SF-2898343: Behaviour problem of wildcard function
- SF-2899784: duplication of returned field in returned xml
OpenSearchServer 1.1.1
Bug fixes
- SF-2886651: Default operator is not preserved when saving query
- SF-2886645: Issue with lemmatisation using the web crawler
- SF-2884347: Field seem to be lost if renamed
- SF-2884344: Renaming a field then deleting: not removed from the bo
- SF-2884275: Duplicated documents when using empty index template
OpenSearchServer 1.1
New features
- SF-2882224: Stress test for insertion
- SF-2881642: Set maximum number of segments for index optimization
- SF-2865088: Add default parser for web and file crawler
- SF-2864468: Indexation plugin within web crawler
- SF-2855049: Add a description for parsers
- SF-2854421: Added Romanian, Turkish and Danish support
- SF-2848588: Robots.txt view on web crawler
- SF-2847820: Add Russian and Chinese language support
- SF-2846159: Replace textfield by a menu for langage selection
- SF-2843516: Sort field list by alphabetic order
- SF-2843182: Use the library apache commons.io
- SF-2836913: Return the rows parameter in the search API result
- SF-2833103: Remove XML/HTML tag on XML upload
- SF-2826651: Synonyms support
- SF-2828734: Add Spellchecker ability
- SF-2812808: Management of several indices
- SF-2818592: Unique field and default field set up
- SF-2818075: OpenDocument Format support
- SF-2816562: New data directory in package
- SF-2816561: Upgrade documentation for 1.1
- SF-2813996: Add schema tab panel to manager analyzer and fields
Bug fixes
- SF-2882223: Error when inserting with concurrent access
- SF-2881248: Web crawler does not follow redirection for robots.txt
- SF-2881227: Wrong encoding detection on web crawler
- SF-2881141: Can add fields in the schema for any type of index
- SF-2880577: Web crawler doesn't follow redirections correctly
- SF-2873319: Facets pagination in the interface doesn't work
- SF-2872576: Add a web pattern api
- SF-2863639: Prefix query changes term to lowercase
- SF-2859658: Tab Crawler / Web doesn't change when switching index
- SF-2856869: NPE Exception when field query is empty
- SF-2855999: Web page title parsing issue
- SF-2849135: File attribute loss in synonym filter and stop filter
- SF-2843033: Failure to create and use an index using the empty template
- SF-2841058: XML Conf: Exception with query template field sorting
- SF-2839468: XML Conf: Exception when using SizeFragmenter and restart
- SF-2832683: Exception when loading synonyms
- SF-2829664: Snippets and Facets not saved on the FS
- SF-2826959: Unable to set the langage in update form
- SF-2824320: Unable to update index after a field update
- SF-2823857: Search API, collapse.mode=full creates NullPointerException
- SF-2823851: setclasspath.bat: blocking problem with some checks
- SF-2820276: Add button is inactive on update form
OpenSearchServer 1.0 beta 2
New features
- SF-2812149: Updated Zkoss to version 3.6.2
- SF-2811901: Added collapsing for all documents to be returned
- SF-2808582: Added paging for query results
- SF-2807080: Updated Apache Tomcat to version 6.0.20
Bug fixes
- SF-2811902: Content type is now provided in XML API responses
- SF-2810881: Useless empty lines within XML results deleted
- SF-2808554: Collapsing now returns as many rows as expected
- SF-2805843: start.bat and stop.bat no longer require CATALINA_HOME
Documentation updates
- SF-2805848: Added JAVA_HOME documentation