Releases: CentreForDigitalHumanities/I-analyzer
5.6.1
What's Changed
Documentation
- Added documentation about loading corpora before indexing (#1555)
- Added documentation on Django project settings (#1530)
Bugfixes
- Fixed an issue where field coverage did not load for anonymous users (#1552)
Preparation for planned features
- Added module to parse a JSON corpus definition (#1519)
- Added module to get the source data reader corpora without a Python definition (#1541)
Changes in server configuration
The data_directory
for a corpus is now validated when the corpus is loaded into the database. It must either be empty, or an existing directory. If the server configuration specifies nonexistent directories, they should be removed.
Contributors: @JeltevanBoheemen, @tijmenbaarda, @lukavdplas
Full Changelog: 5.6.0...5.6.1
5.6.0
What's Changed
New features
- Users can now access I-analyzer without logging in (#1484, #1498, #1539)
- The site footer now displays the version of I-analyzer (#1479, #1492)
- Added new options for corpus documentation: citation, licence, and terms of service (#1488)
- Added a tag overview menu (#1463)
- Updated the layout of document previews in search results (#1476)
Accessibility
- Standardised keyboard controls for dropdown menus (#1465)
- Set the
lang
property of a field's content based on the corpus definition (#1335)
Updates to corpora
- Various corpora: renamed some fields named "URL" for clarity (#1491)
- People & Parliament Ireland: enable adding word2vec models (#1490)
- Jewish Migration: allow loading local source data (#1510)
- DBNL: fixed the chapter order when opening "view book" (#1503)
- Updated images for several corpora (#1485)
Bug fixes
- Fixed paragraph breaks not showing up in the document view (#1477, #1511)
- Fixes to controls for viewing scanned documents (#1493)
- Fixed rendering of geopoint array (#1444)
- Fixed the layout of barchart options menu on mobile devices (#1487)
Code quality
- Avoid repeated downloading of NLTK stopwords (#1481)
- Update elasticsearch in docker configuration (#1482)
- Refactored visualisation selection menu (#1467)
- Added backend dependence on elasticsearch service in docker configuration (#1471)
- Refactored wordcloud component (#1475)
- Forbid 'scan' and 'tab-scan' field names in corpus definitions (#1496)
- Updated documentation about Python versions (#1499)
- Default sorting is now a property of a Corpus object, rather than its fields (#1501)
- Updated the file structure in the
addcorpus
module (#1502) - Improved corpus validation: distinguish validation at indexing and publishing stage (#1504)
- Retrieve wordcloud stopwords from the field language, rather than corpus settings (#1514)
- Allow checking whether a corpus has a Python definition (#1515)
- Corpus images and documentation are now saved in the database (#1526, #1531)
- Added workflow for updating CITATION.cff (#1534)
Preparations for planned features
These changes add functionality for planned features.
Changes in server configuration
These changes are required when updating a server to this version:
In the backend settings.py
:
- configure the
MEDIA_ROOT
setting. Corpus images will be saved here.
In the frontend environment.ts
:
- add
directDownloadLimit
property (set to1000
to keep the same behaviour) - add
version
andsourceUrl
properties. See the development environment for an example.
Contributors: @BeritJanssen, @JeltevanBoheemen, @ar-jan, @lukavdplas
Full Changelog: 5.5.1...5.6.0
5.5.1
What's Changed
- Fixed: page controls did not work in some corpora #1523
Full Changelog: 5.5.0...5.5.1
5.5.0
What's Changed
- Implement routing in PageResults by @lukavdplas in #1405
- Feature/cancel tasks by @BeritJanssen in #1387
- Improved routing in QueryModel by @lukavdplas in #1406
- lazy load document tags by @lukavdplas in #1402
- versioning docs by @lukavdplas in #1448
- Bump django from 4.2.7 to 4.2.10 in /backend by @dependabot in #1411
- Feature/integrate ianalyzer readers by @lukavdplas in #1449
- Update Docker Node version from 14 to 18 by @ar-jan in #1451
- Bugfix/parse weird doc ids by @lukavdplas in #1450
- Bugfix/vis spinner by @BeritJanssen in #1458
- Feature/debugging convenience by @lukavdplas in #1447
- Prrevent navbar from being obscured by search bar by @JeltevanBoheemen in #1462
- Base DownloadOptionsComponent loading spinner on parent DownloadComponent by @JeltevanBoheemen in #1461
- Add size parameter only once on download by @JeltevanBoheemen in #1460
Full Changelog: 5.4.0...5.5.0
5.4.0
What's Changed
New features
- Added the option for fields with a "date range" datatype (#1357)
- Filter search results based on the tags you assigned to documents (#1393)
- Added support for two new file types in corpus source files: XLSX (#1348) and JSON (#1392)
New corpora
- Added corpus of course descriptions from the Humanities faculty (#1348)
- Added Jewish migration corpora #1392
Usability
- More consistent icons in the interface #1361
- Improved accessibility for the slider control of the range filter (#1334)
Bug fixes
- Fixed an issue that where tests for PEACE portal corpora could fail due to the order in which files are read (#1350)
- Fixed an issue when searching word models for a word that isn't in the model (#1360)
- Fixed an issue where word model graphs would not show colours in the legend (#1358)
- Fixed an issue where the time interval slider for word model graphs would not reset properly (#1372)
- Fixed various issues that stopped scan images from being displayed (#1370)
- Fixed an issue where the page would scroll up after changing options in the interface (#1373)
- Fixed the display of long queries in the search history (#1375)
- Fixed the query help menu popping up unexpectedly, missing navigation in the manual, and the download menu for scans not closing properly (#1389)
- Fixed the "query in context" field not being shown when configuring a download of search results (#1398)
Code quality
- Added functional test setup (#1239)
- Formatting in the frontend code (#1356)
- Fixed a lifecycle issue for the visualisation component (#1390)
- Docker setup does not update python dependencies in the source code (#1386)
- Separated frontend and backend tests in Github actions (#1399)
- Added base class definitions for improved state management in the frontend (#1404)
- Adjusted yarn commands (#1407)
Contributors: @BeritJanssen , @JeltevanBoheemen , @Meesch , @lukavdplas
Full Changelog: 5.3.0...5.4.0
5.4.0 (pre-release)
5.3.0
What's Changed
New features
- Users can now add tags to documents (#1199, #1332)
- Restructured the site header and improved its accessiblity to keyboard and screen reader users (#1325, #1327)
- When viewing documents, hover over a field name to see its description (#1341)
New corpora
Bug fixes
- Fixed an issue where the search history would show up as a blank page (#1338)
Github & documentation
- Updated the README and moved developer instructions to
/documentation
(#1326) - Changed the bug report issue template to a form (#1302)
Code quality
- Saving search history is now managed by the backend (#1306)
- Refactored the representation of search results in the frontend (#1307)
Full Changelog: 5.2.0...5.3.0
5.2.0
What's Changed
New features
- Wordcloud shows tooltips (#1296)
Bug fixes
- Fixed corpus permissions for SAML users (#1298)
- Fixed visualisation functions when date field is not called
date
(#1299)
Full Changelog: v5.1.0...5.2.0
v5.1.0
What's Changed
- Add issue templates (#1293)
- Search history settings (ttps://github.com//pull/1201)
- Change naming convention of CSV filenames (#1283)
- Add missing text image tabs (#1300)
- Update corpus settings (#1290)
- Allow download full data for ngram visualization (#1289)
Full Changelog: 5.0.1...v5.1.0
5.0.1
Fix SAML login error by reenabling djangosaml2 middleware.
Full Changelog: v5.0.0...5.0.1