Merge branch 'develop' of https://github.com/GAA-UAM/scikit-fda.git i…

…nto develop
GAA-UAM · Oct 1, 2019 · 1ec0452 · 1ec0452
2 parents 1916415 + c448f3d
commit 1ec0452
Show file tree

Hide file tree

Showing 13 changed files with 635 additions and 38 deletions.
diff --git a/README.rst b/README.rst
@@ -44,22 +44,18 @@ Installation from source
 It is possible to install the latest version of the package, available in the
 develop branch,  by cloning this repository and doing a manual installation.
 
-.. code::
+.. code:: bash
 
     git clone https://github.com/GAA-UAM/scikit-fda.git
-    cd scikit-fda/
-    pip install -r requirements.txt    # Install dependencies
-    python setup.py install
+    pip install ./scikit-fda
 
 Make sure that your default Python version is currently supported, or change
 the python and pip commands by specifying a version, such as ``python3.6``:
 
-.. code::
+.. code:: bash
 
     git clone https://github.com/GAA-UAM/scikit-fda.git
-    cd scikit-fda/
-    python3.6 -m pip install -r requirements.txt    # Install dependencies
-    python3.6 setup.py install
+    python3.6 -m pip install ./scikit-fda
 
 Requirements
 ------------
@@ -88,11 +84,11 @@ The people involved at some point in the development of the package can be
 found in the `contributors
 file <https://github.com/GAA-UAM/scikit-fda/blob/develop/THANKS.txt>`_.
 
-Citation
-========
-If you find this project useful, please cite:
+.. Citation
+   ========
+   If you find this project useful, please cite:
 
-.. todo:: Include citation to scikit-fda paper.
+   .. todo:: Include citation to scikit-fda paper.
 
 License
 =======

diff --git a/docs/conf.py b/docs/conf.py
@@ -17,9 +17,6 @@
 # add these directories to sys.path here. If the directory is relative to the
 # documentation root, use os.path.abspath to make it absolute, like shown here.
 #
-# import os
-# import sys
-# sys.path.insert(0, '/home/miguel/Desktop/fda/fda')
 
 import os
 import sys
@@ -79,7 +76,8 @@
 
 # General information about the project.
 project = 'scikit-fda'
-copyright = '2017, Author'
+copyright = ('2019, Grupo de Aprendizaje Automático - ' +
+             'Universidad Autónoma de Madrid')
 author = 'Author'
 
 # The language for content autogenerated by Sphinx. Refer to documentation

diff --git a/docs/index.rst b/docs/index.rst
@@ -5,18 +5,78 @@
 Welcome to scikit-fda's documentation!
 ======================================
 
+This package offers classes, methods and functions to give support to
+Functional Data Analysis in Python. Includes a wide range of utils to work with
+functional data, and its representation, exploratory analysis, or
+preprocessing, among other tasks such as inference, classification, regression
+or clustering of functional data.
+
+In the `project page <https://github.com/GAA-UAM/scikit-fda>`_ hosted by
+Github you can find more information related to the development of the package.
+
+
 .. toctree::
-   :includehidden:
-   :maxdepth: 4
+   :maxdepth: 2
    :caption: Contents:
    :titlesonly:
 
    apilist
+
+
+.. toctree::
+   :maxdepth: 1
+   :titlesonly:
+
    auto_examples/index
 
-Indices and tables
-==================
+An exhaustive list of all the contents of the package can be found in the
+:ref:`genindex`.
+
+Installation
+------------
+
+Currently, scikit-fda is available in Python 3.6 and 3.7, regardless of the
+platform. The stable version can be installed via
+`PyPI <https://pypi.org/project/scikit-fda/>`_:
+
+.. code-block:: bash
+
+   pip install scikit-fda
+
+
+It is possible to install the latest version of the package, available in
+the develop branch, by cloning this repository and doing a manual installation.
+
+.. code-block:: bash
+
+   git clone https://github.com/GAA-UAM/scikit-fda.git
+   pip install ./scikit-fda
+
+
+In this type of installation make sure that your default Python version is
+currently supported, or change the python and pip commands by specifying a
+version, such as python3.6.
+
+
+Contributions
+-------------
+
+All contributions are welcome. You can help this project grow in multiple ways,
+from creating an issue, reporting an improvement or a bug, to doing a
+repository fork and creating a pull request to the development branch.
+The people involved at some point in the development of the package can be
+found in the `contributors file
+<https://github.com/GAA-UAM/scikit-fda/blob/develop/THANKS.txt>`_.
+
+.. Citation
+   --------
+   If you find this project useful, please cite:
+
+   .. todo:: Include citation to scikit-fda paper.
+
+License
+-------
 
-* :ref:`genindex`
-* :ref:`modindex`
-* :ref:`search`
+The package is licensed under the BSD 3-Clause License. A copy of the
+`license <https://github.com/GAA-UAM/scikit-fda/blob/develop/LICENSE.txt>`_
+can be found along with the code or in the project page.
diff --git a/docs/modules/datasets.rst b/docs/modules/datasets.rst
@@ -17,6 +17,7 @@ The following functions are used to retrieve specific functional datasets:
    skfda.datasets.fetch_medflies
    skfda.datasets.fetch_weather
    skfda.datasets.fetch_aemet
+   skfda.datasets.fetch_octane
 
 Those functions return a dictionary with at least a "data" field containing the
 instance data, and a "target" field containing the class labels or regression values,

diff --git a/docs/modules/exploratory/outliers.rst b/docs/modules/exploratory/outliers.rst
@@ -4,12 +4,15 @@ Outlier detection
 Functional outlier detection is the identification of functions that do not seem to behave like the others in the
 dataset. There are several ways in which a function may be different from the others. For example, a function may
 have a different shape than the others, or its values could be more extreme. Thus, outlyingness is difficult to
-categorize exactly as each outlier detection method looks at different features of the functions in order to 
+categorize exactly as each outlier detection method looks at different features of the functions in order to
 identify the outliers.
 
 Each of the outlier detection methods in scikit-fda has the same API as the outlier detection methods of
 `scikit-learn <https://scikit-learn.org/stable/modules/outlier_detection.html>`_.
 
+Interquartilic Range Outlier Detector
+------------------------------------
+
 One of the most common ways of outlier detection is given by the functional data boxplot. An observation is marked
 as an outlier if it has points :math:`1.5 \cdot IQR` times outside the region containing the deepest 50% of the curves
 (the central region), where :math:`IQR` is the interquartilic range.
@@ -18,19 +21,23 @@ as an outlier if it has points :math:`1.5 \cdot IQR` times outside the region co
    :toctree: autosummary
 
    skfda.exploratory.outliers.IQROutlierDetector
-
+
+
+DirectionalOutlierDetector
+--------------------------
+
 Other more novel way of outlier detection takes into account the magnitude and shape of the curves. Curves which have
 a very different shape or magnitude are considered outliers.
 
 .. autosummary::
    :toctree: autosummary
 
    skfda.exploratory.outliers.DirectionalOutlierDetector
-   
+
 For this method, it is necessary to compute the mean and variation of the directional outlyingness, which can be done
 with the following function.
 
 .. autosummary::
    :toctree: autosummary
 
-   skfda.exploratory.outliers.directional_outlyingness_stats
+   skfda.exploratory.outliers.directional_outlyingness_stats
diff --git a/skfda/_neighbors/base.py b/skfda/_neighbors/base.py
@@ -97,11 +97,11 @@ def multivariate_metric(x, y, _check=False, **kwargs):
 class NeighborsBase(ABC, BaseEstimator):
     """Base class for nearest neighbors estimators."""
 
-    @abstractmethod
     def __init__(self, n_neighbors=None, radius=None,
                  weights='uniform', algorithm='auto',
                  leaf_size=30, metric='l2', metric_params=None,
                  n_jobs=None, multivariate_metric=False):
+        """Initializes the nearest neighbors estimator"""
 
         self.n_neighbors = n_neighbors
         self.radius = radius
@@ -166,6 +166,7 @@ def fit(self, X, y=None):
                     metric = lp_distance
                 else:
                     metric = self.metric
+
                 sklearn_metric = _to_multivariate_metric(metric,
                                                          self._sample_points)
             else:
@@ -203,7 +204,7 @@ def kneighbors(self, X=None, n_neighbors=None, return_distance=True):
                 Indices of the nearest points in the population matrix.
 
         Examples:
-            Firstly, we will create a toy dataset with 2 classes
+            Firstly, we will create a toy dataset.
 
             >>> from skfda.datasets import make_sinusoidal_process
             >>> fd1 = make_sinusoidal_process(phase_std=.25, random_state=0)
@@ -260,7 +261,7 @@ def kneighbors_graph(self, X=None, n_neighbors=None, mode='connectivity'):
             A[i, j] is assigned the weight of edge that connects i to j.
 
         Examples:
-            Firstly, we will create a toy dataset with 2 classes.
+            Firstly, we will create a toy dataset.
 
             >>> from skfda.datasets import make_sinusoidal_process
             >>> fd1 = make_sinusoidal_process(phase_std=.25, random_state=0)
@@ -329,7 +330,7 @@ def radius_neighbors(self, X=None, radius=None, return_distance=True):
                 within a ball of size ``radius`` around the query points.
 
         Examples:
-            Firstly, we will create a toy dataset with 2 classes.
+            Firstly, we will create a toy dataset.
 
             >>> from skfda.datasets import make_sinusoidal_process
             >>> fd1 = make_sinusoidal_process(phase_std=.25, random_state=0)

diff --git a/skfda/_neighbors/classification.py b/skfda/_neighbors/classification.py
@@ -59,8 +59,9 @@ class KNeighborsClassifier(NeighborsBase, NeighborsMixin, KNeighborsMixin,
         Doesn't affect :meth:`fit` method.
     multivariate_metric : boolean, optional (default = False)
         Indicates if the metric used is a sklearn distance between vectors (see
-        :class:`sklearn.neighbors.DistanceMetric`) or a functional metric of
-        the module :mod:`skfda.misc.metrics`.
+        :class:`~sklearn.neighbors.DistanceMetric`) or a functional metric of
+        the module `skfda.misc.metrics` if ``False``.
+
     Examples
     --------
     Firstly, we will create a toy dataset with 2 classes
@@ -96,6 +97,7 @@ class KNeighborsClassifier(NeighborsBase, NeighborsMixin, KNeighborsMixin,
     :class:`~skfda.ml.regression.KNeighborsRegressor`
     :class:`~skfda.ml.regression.RadiusNeighborsRegressor`
     :class:`~skfda.ml.clustering.NearestNeighbors`
+    
 
     Notes
     -----
@@ -254,6 +256,7 @@ class RadiusNeighborsClassifier(NeighborsBase, NeighborsMixin,
     :class:`~skfda.ml.regression.RadiusNeighborsRegressor`
     :class:`~skfda.ml.clustering.NearestNeighbors`
 
+
     Notes
     -----
     See Nearest Neighbors in the sklearn online documentation for a discussion
@@ -358,6 +361,7 @@ class and return a :class:`FData` object with only one sample
     :class:`~skfda.ml.regression.RadiusNeighborsRegressor`
     :class:`~skfda.ml.clustering.NearestNeighbors`
 
+
     """
 
     def __init__(self, metric='l2', mean='mean'):