Seqan compat/seqan2 char alphabet #12

rrahn · 2020-09-17T13:10:33Z

So I did look into the PR for multiple alignment. It is quite nice already.
I still got a little scared by the alphabet conversions since that would destroy everything regarding generic code :).
So Marcel and I had to test if it is indeed possible to use type erasure over char to run the multiple sequence alignment. And in fact it does pretty good. And that also means, this would make transferring the parts to seqan3 much easier because a lot of boiler plate that is alphabet dependent in SeqAn already can be removed.

I made a PR for the changes, since I was kind of playing with the code anyway. The only thing I did as well was to make the global interface independent of this default msa config thing, since it is basically not necessary. All defaults are set internally.
The rest is part of the review. I hope this procedure is ok with you. Otherwise, I was afraid that it might have gone wild to explain everything in the github comments :).

I think you should rebase on master before looking into it? It is only the last commit though.

[INFRA] seqan3/std/* header files MUST NOT include any seqan3 header

…iguration.

Selects the correct simd scoring scheme based on the given matrix type. For the moment to test and implement the featurs of protein simd alignment we can use this simple static differentiation. Later this will be replaced by a dynamic dispatching mechanism.

Makes a test template for the alignment benchmark so we can reuse it for different benchmark settings.

When tracking the last cell of the banded column computation the matrix iterator was referring to the wrong alignment cell.

Before, the algorithm always tracked the last cell of the current column irrespective of its position within the global matrix. This means, that the optimum could point to a cell that does not represent the full sequence and hence would not be a valid semi-gobal alignment.

…ble_banded_end_position Optimise alignment/part6 enable banded end position

…microbenchmarks Protein alignment/part1 add microbenchmarks

[DOC] Update cppreference index

[FIX] Timeout in debug nightlies

[FIX] Wrong ranks in search algorithm

pull changes from release-3.0.2 into master - progress of week 05

…eqan_compat/seqan2_char_alphabet

…bility overhead. By applying this type erasure, we are actually generic to allow any user input as long as it fulfils the respective concepts that are modeled explicitly for alphabets and scoring schemes. It also is an initial step to reduce the boiler plate overloads in seqan2 when we start adapting the algorithms step by step.

codecov-commenter · 2020-09-17T13:35:37Z

Codecov Report

Merging #12 into tcoffee will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff            @@
##           tcoffee      #12   +/-   ##
========================================
  Coverage    97.87%   97.88%           
========================================
  Files          269      269           
  Lines        10119    10146   +27     
========================================
+ Hits          9904     9931   +27     
  Misses         215      215

Impacted Files	Coverage Δ
...clude/seqan3/alignment/multiple/align_multiple.hpp	`100.00% <100.00%> (ø)`
...ltiple/detail/align_multiple_seqan2_adaptation.hpp	`100.00% <100.00%> (ø)`
...qan3/alignment/pairwise/alignment_configurator.hpp	`100.00% <100.00%> (ø)`
...ise/detail/pairwise_alignment_algorithm_banded.hpp	`100.00% <100.00%> (ø)`
...ignment/pairwise/detail/policy_optimum_tracker.hpp	`97.29% <100.00%> (+1.00%)`	⬆️
include/seqan3/search/search.hpp	`100.00% <100.00%> (ø)`
include/seqan3/io/stream/iterator.hpp	`98.52% <0.00%> (-0.03%)`	⬇️
include/seqan3/search/search_result.hpp	`100.00% <0.00%> (ø)`
...de/seqan3/argument_parser/detail/version_check.hpp	`92.74% <0.00%> (ø)`
...ment/scoring/detail/simd_matrix_scoring_scheme.hpp	`100.00% <0.00%> (ø)`
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cfbd2b9...2aff6ec. Read the comment docs.

marehr and others added 25 commits September 3, 2020 12:20

[INFRA] seqan3/std/* header files MUST NOT include any seqan3 header

95e3421

Merge pull request seqan#2093 from marehr/std_platfrom_not_present

e211e6d

[INFRA] seqan3/std/* header files MUST NOT include any seqan3 header

[MISC] Update the simd benchmark to use correct alignment output conf…

758fd29

…iguration.

[TEST] Add test for aa27 in global affine alignment.

80b2dc4

[TEST] Add initial microbenchmark for simd protein alignment.

e0472fa

[MISC] Refactor global alignment simd benchmark.

98f9ae1

Makes a test template for the alignment benchmark so we can reuse it for different benchmark settings.

[TEST] Move the simd protein benchmark to the new test template.

96af05c

[FIX] Track correct coordinate in banded alignment.

4eec66a

When tracking the last cell of the banded column computation the matrix iterator was referring to the wrong alignment cell.

[MISC] Refactor semi-global banded alignment test configs.

13f00dd

[TEST] More tests for semi-global banded alignment.

a0cfcc7

Merge pull request seqan#2106 from rrahn/optimise_alignment/part6_ena…

2a40134

…ble_banded_end_position Optimise alignment/part6 enable banded end position

Merge pull request seqan#2108 from rrahn/protein_alignment/part1_add_…

f50d9e9

…microbenchmarks Protein alignment/part1 add microbenchmarks

[FIX] Timeout in debug nightlies

7ad19a7

[DOC] Update cppreference index

5ec8be7

Merge pull request seqan#2114 from marehr/update_dox

16ad3f8

[DOC] Update cppreference index

[FIX] Wrong ranks in search algorithm

e613754

[TEST] add debug_stream test for search

93efca7

Merge pull request seqan#2109 from eseiler/fix/timeout

b0e2099

[FIX] Timeout in debug nightlies

Merge pull request seqan#2116 from eseiler/fix/ranks

60201f8

[FIX] Wrong ranks in search algorithm

Merge remote-tracking branch 'upstream/release-3.0.2' into week05

0aa8f9f

Merge pull request seqan#2117 from marehr/week05

d62f6ad

pull changes from release-3.0.2 into master - progress of week 05

Merge branch 'tcoffee' of https://github.com/smehringer/seqan3 into s…

593186f

…eqan_compat/seqan2_char_alphabet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seqan compat/seqan2 char alphabet #12

Seqan compat/seqan2 char alphabet #12

rrahn commented Sep 17, 2020

codecov-commenter commented Sep 17, 2020

Seqan compat/seqan2 char alphabet #12

Are you sure you want to change the base?

Seqan compat/seqan2 char alphabet #12

Conversation

rrahn commented Sep 17, 2020

codecov-commenter commented Sep 17, 2020

Codecov Report