Releases: diegozea/MIToS.jl
v3.0.6
MIToS v3.0.6
v3.0.5
MIToS v3.0.5
v3.0.4
MIToS v3.0.4
v3.0.3
MIToS v3.0.3
v3.0.2
MIToS v3.0.2
v3.0.1
v3.0.0
MIToS v3.0.0
MIToS v3.0.0 requires Julia v1.9 or higher, dropping support for older versions. This
release introduces several breaking changes to improve the usability of the package.
When possible, deprecation warnings are used to inform you of the changes.
MIToS.MSA
The MSA module now includes ways to read, write, and work with unaligned protein sequences:
-
The
MSA
module now exports theAnnotatedSequence
type to represent a single protein
sequence with annotations. This type is a subtype of the newAbstractSequence
type,
a subtype of the newAbstractResidueMatrix
type. -
The
MSA
module now exports thesequence_id
function to get the identifier of a
sequence object. -
The
MSA
module now defines theFASTASequences
,PIRSequences
, andRawSequences
file formats to read and write (unaligned) protein sequences in FASTA, PIR, and raw
formats, respectively. -
[Breaking change] The behavior of the
getannotresidue
,getannotsequence
,
setannotresidue!
, andsetannotsequence!
functions have changed for sequences objects,
such asAnnotatedSequence
,AnnotatedAlignedSequence
, andAlignedSequence
. Now, these
functions take the feature name, rather than the sequence name, as the second
positional argument. As an example of migration,
getannotsequence(sequence, "sequence_name", "feature_name")
should be replaced by
getannotsequence(sequence, "feature_name")
. You still need to specify the sequence name
when working with MSA objects.
Other changes in the MSA module are:
-
[Breaking change] The
join
function forAnnotatedMultipleSequenceAlignment
objects
is deprecated in favor of thejoin_msas
function. -
[Breaking change] The
Clusters
type is no longer a subtype ofClusteringResult
from
theClustering.jl
package. Instead, theClusters
type is now a subtype of the new
AbstractCluster
type. Support for theClustering.jl
interface is still available
through package extensions. You now need to load theClustering.jl
package to use the
assignments
,nclusters
, andcounts
functions.
MIToS.PDB
The PDB module now depends on the BioStructures
package. The main changes in the PDB
module are:
-
The
PDB
module now exports theMMCIFFile
file format to read and write PDB files in
the mmCIF format (usingBioStructures
under the hood). -
[Breaking change] The
download_alphafold_structure
function can now download the
predicted structures from the AlphaFold Protein Structure Database using the mmCIF
format (format=MMCIFFile
). This is the new default format. Therefore, you should use
format=PDBFile
to get a PDB file like before. For example,
download_alphafold_structure("P00520")
in previous versions is the same as
download_alphafold_structure("P00520", format=PDBFile)
in this version. -
[Breaking change] The
downloadpdb
function now returns a mmCIF file by default.
Therefore, you should useformat=PDBML
to get a PDBML file. As an example of migration,
downloadpdb("1IVO")
should be replaced bydownloadpdb("1IVO", format=PDBML)
, unless
you want to get a mmCIF file. -
[Breaking change] The
PDBAtom
type now adds two extra fields:alt_id
andcharge
to represent the alternative location indicator and the atom's charge, respectively.
This improves the compatibility with the mmCIF format and theBioStructures
package. -
[Breaking change] The
query_alphafolddb
function now returns the EntrySummary object
of the returned JSON response instead of the Root list. Therefore, there is no need to
take the first element of the list to get the required information. For example,
query_alphafolddb("P00520")[1]["uniprotId"]
would be replaced by
query_alphafolddb("P00520")["uniprotId"]
.
MIToS.Utils.Scripts
- [Breaking change] The
MIToS.Utils.Scripts
module and the MIToS scripts have been
moved to their package at MIToS_Scripts.jl.
Therefore, theMIToS.Utils.Scripts
module is no longer exported. This allows for a
reduction in the number of MIToS dependencies, and improved load time.
v2.22.0
MIToS v2.22.0
This versions introduces several breaking changes to improve the usability of the
Information
module. The main changes are:
-
[Breaking change] The
Information
module deprecates theCounts
type in favor of
the newFrequencies
type. The new type as the same signature and behavior as the old one. -
[Breaking change] The
count
function on sequences has been deprecated in favor of the
frequencies
function, which has the same signature and behavior as the old one. -
[Breaking change] The
count!
function is deprecated in favor offrequencies!
.
The new function use keyword arguments to define the weights and pseudocounts. As an
example of migration,count!(table, weights, pseudocounts, seqs...)
should be replaced
byfrequencies!(table, seqs..., weights=weights, pseudocounts=pseudocounts)
. -
[Breaking change] The
probabilities!
method using positional arguments for the
weights, pseudocounts and pseudofrequencies is deprecated in favor the one that uses
keyword arguments. As an example of migration,
probabilities!(table, weights, pseudocounts, pseudofrequencies, seqs...)
should be replaced by
probabilities!(table, seqs..., weights=weights, pseudocounts=pseudocounts, pseudofrequencies=pseudofrequencies)
. -
[Breaking change] The
Information
has deprecated theentropy
method on
Frequencies
andProbabilities
in favor of theshannon_entropy
function. The
definition of the base is now done using thebase
keyword argument. As an example of
migration,entropy(p, 2)
should be replaced byshannon_entropy(p, base=2)
. -
[Breaking change] The
marginal_entropy
methods based on positional arguments are
deprecated in favor of a method relying on themargin
andbase
keyword arguments.
As an example of migration,marginal_entropy(p, 2, 2.0)
should be replaced by
marginal_entropy(p, margin=2, base=2.0)
. -
[Breaking change] The
mutual_information
method based on positional arguments is
deprecated in favor of a method relying on thebase
keyword argument. As an example of
migration,mutual_information(p, 2)
should be replaced bymutual_information(p, base=2)
. -
[Breaking change] The
mapcolpairfreq!
andmapseqpairfreq!
functions now uses the
booleanusediagonal
keyword argument to indicate if the function should be applied to
the diagonal elements of the matrix (the default istrue
). Before, this was done passing
Val{true}
orVal{false}
as the last positional argument. -
The
mapcolfreq!
,mapseqfreq!
,mapcolpairfreq!
, andmapseqpairfreq!
methods using
keyword arguments, now pass the extra keyword arguments to the mapped function. -
The
Information
module now exports themapfreq
function that offers a more high-level
interface to themapcolfreq!
,mapseqfreq!
,mapcolpairfreq!
, andmapseqpairfreq!
functions. This function allows the user to map a function to the residue frequencies or
probabilities of the columns or sequences of an MSA. Whenrank = 2
, the function is
applied to pairs of sequences or columns. -
The
Information
module now exports methods of theshannon_entropy
,kullback_leibler
,
mutual_information
, andnormalized_mutual_information
functions that take an
AbstractArray{Residue}
as input, e.g. an MSA. Those methods use themapfreq
function
under the hood to ease the calculation of the information measures on MSAs. -
The
frequencies!
,frequencies
,probabilities!
, andprobabilities
functions now
accept arrays ofResidue
s of any dimension. Therefore, there is no need to use the
vec
function to convert the arrays to vectors. -
The
MSA
module now exports theWeightType
union type to representweights
.