Use the doctest module in get_example_data #308

asmeurer · 2023-10-20T21:59:07Z

Fixes #282

Still several todos here:

Clean up code (probably don't need to define the class inside of the function)
Add tests
Check that the generated JSON is as desired (the execution status is not actually included. I'm not clear why)
Make it so that any doctest anywhere is run, not just ones in "Examples" (this doesn't necessarily need to be done in this PR)
Allow libraries to configure doctest options (like ELLIPSIS)

Here's an example:

def docstring(x):
    """
    Examples
    ========

    >>> from test_mod import docstring
    >>> a = docstring(1)
    >>> a
    2

    >>> 1 + a
    3

    >>> import matplotlib.pyplot as plt
    >>> plt.plot([0, 1], [0, 1])
    >>> plt.show()

    >>> 1 + 1
    2

    >>> syntax error

    >>> 1/0 # exception
    Traceback (most recent call last):
      File "<stdin>", line 1, in <module>
    ZeroDivisionError: division by zero

    >>> 1/0 # unexpected exception
    """
    return x + 1

__version__ = '0'

[global]
module = 'test_mod'

Generates

  "example_section_data": {
    "children": [
      {
        "type": "code",
        "value": "from test_mod import docstring\n"
      },
      {
        "type": "code",
        "value": "a = docstring(1)\n"
      },
      {
        "type": "code",
        "value": "a\n"
      },
      {
        "type": "code",
        "value": "1 + a\n"
      },
      {
        "type": "code",
        "value": "import matplotlib.pyplot as plt\n"
      },
      {
        "type": "code",
        "value": "plt.plot([0, 1], [0, 1])\n"
      },
      {
        "type": "code",
        "value": "plt.show()\n"
      },
      {
        "type": "Fig",
        "value": {
          "kind": "assets",
          "module": "test_mod",
          "path": "fig-test_mod:docstring-0-c8430bd5.png",
          "type": "RefInfo",
          "version": "0"
        }
      },
      {
        "type": "code",
        "value": "1 + 1\n"
      },
      {
        "type": "code",
        "value": "syntax error\n"
      },
      {
        "type": "code",
        "value": "1/0 # exception\n"
      },
      {
        "type": "code",
        "value": "1/0 # unexpected exception\n"
      }
    ],

They aren't actually that important for doctests because they are only used for the reporting, which we are bypassing anyways.

asmeurer · 2023-11-11T01:10:48Z

I think the main thing that needs to be done here is now is to take a look at the generated JSON and see if we like how it looks. It's not hard to change what is there.

Doesn't do anything right now because success/failure isn't saved

Carreau · 2023-11-16T17:20:40Z

To do for me:

Check why the exec_status is in the JSON
Check wether the output of doctests is in the Json.
Add the tests examples.

asmeurer · 2023-11-16T17:34:46Z

You should also just review the code, and run this against the existing example configurations to make sure nothing funny is happening.

Carreau · 2023-11-17T10:15:55Z

I pushed a commit that inject a debug function instead of lambda s: None,

It seem that some of the parsing is incorrect, as I get an

$ papyri gen examples/papyri.toml --only papyri.examples:example1
...
Unexpected exception (<class 'SyntaxError'>, SyntaxError('multiple statements found while compiling a single statement', ('<doctest example1[0]>', 1, 32, 'import matplotlib.pyplot as plt\n', 1, 32)), <traceback object at 0x1202d7a40>)

Note that this debug message make it looks like the exec(compile(..., 'single')) in doctest got a line with \n et the end, but it does get a multiple line.

I'm not sure why this is happening or why the code here is wrong. I'll investigate.

Carreau · 2023-11-17T10:30:38Z

Ha, I think it considers ... as continuation always. So replacing ... with >>> in a couple of places works.

And that make me realize we should have a custom parser in IPython/testing/plugin/ipdoctest.py

They aren't actually that important for doctests because they are only used for the reporting, which we are bypassing anyways.

Doesn't do anything right now because success/failure isn't saved

asmeurer · 2023-12-01T22:16:18Z

In the future, do not force push to other people's branches.

papyri/gen.py

asmeurer · 2023-12-15T21:42:14Z

There seems to be a segfault from one of the plots in the np.sinc doctest

Fatal Python error: Segmentation fault

Current thread 0x00000002016d0240 (most recent call first):
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/numpy/core/fromnumeric.py", line 43 in _wrapit
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/numpy/core/fromnumeric.py", line 54 in _wrapfunc
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/numpy/core/fromnumeric.py", line 2597 in cumsum
  File "<__array_function__ internals>", line 200 in cumsum
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/gridspec.py", line 193 in get_grid_positions
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/_api/deprecation.py", line 384 in wrapper
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/gridspec.py", line 665 in get_position
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/axes/_base.py", line 793 in set_subplotspec
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/axes/_base.py", line 661 in __init__
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/figure.py", line 757 in add_subplot
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/figure.py", line 1628 in gca
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/pyplot.py", line 2309 in gca
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/matplotlib/pyplot.py", line 3084 in title
  File "<doctest sinc[1]>", line 1 in <module>
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/doctest.py", line 1351 in __run
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/doctest.py", line 1498 in run
  File "/Users/aaronmeurer/Documents/papyri/papyri/gen.py", line 1335 in get_example_data
  File "/Users/aaronmeurer/Documents/papyri/papyri/gen.py", line 1655 in prepare_doc_for_one_object
  File "/Users/aaronmeurer/Documents/papyri/papyri/gen.py", line 2129 in collect_api_docs
  File "/Users/aaronmeurer/Documents/papyri/papyri/gen.py", line 558 in gen_main
  File "/Users/aaronmeurer/Documents/papyri/papyri/__init__.py", line 474 in gen
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/typer/main.py", line 683 in wrapper
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/click/core.py", line 760 in invoke
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/click/core.py", line 1404 in invoke
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/click/core.py", line 1657 in invoke
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/typer/core.py", line 216 in _main
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/typer/core.py", line 778 in main
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/click/core.py", line 1130 in __call__
  File "/Users/aaronmeurer/anaconda3/envs/papyri/lib/python3.11/site-packages/typer/main.py", line 311 in __call__
  File "/Users/aaronmeurer/Documents/papyri/papyri/__main__.py", line 3 in <module>
  File "<frozen runpy>", line 88 in _run_code
  File "<frozen runpy>", line 198 in _run_module_as_main

Although there's a separate question which is why the doctests are being run at all with --no-exec.

asmeurer · 2023-12-15T21:50:13Z

I fixed the --no-exec flag. The segfault doesn't happen on main, though. I'm guessing it has something to do with with the fig managers.

Carreau · 2023-12-16T13:25:36Z

I think the culprit of set_numeric_ops (I've pushed a commit that deactivate it), which replace addition with addition mod 5 globally. It might still be a bug but as it's deprecated maybe it's not worth our time tracking it down.

I've pushed a commit that exclude just this function from being executed.

I was also able to reproduce just with

papyri gen examples/numpy.toml --no-narrative --only numpy:set_numeric_ops --only numpy:sinc

Carreau · 2023-12-17T09:58:54Z

papyri/gen.py

+            doctests = doctest.DocTestParser().get_doctest(
+                block, doctest_runner.globs, obj.__name__, filename, lineno
+            )


get_doctests also completely drop interleaving text, so we'll need to fallback to parse(...) and create the DocTests object ourselves here.

… blocks

This is more robust and correct than doing code.split('\n\n').

asmeurer · 2024-01-04T23:12:02Z

I've fixed the parser to be more robust for interleaving text. It now properly handles the case where text is right before an example. My main concerns now are if we are actually including everything we want in the output JSON.

In particular, the JSON output doesn't include the prompts (>>> and ...), and it doesn't include the outputs of the doctests. This is the case even when exec=false. We should presumably fix it to include the outputs, but note that this is also the case in main. For example, here's np.select in main:

  "example_section_data": {
    "children": [
      {
        "type": "code",
        "value": "x = np.arange(6)\ncondlist = [x<3, x>3]\nchoicelist = [x, x**2]\nnp.select(condlist, choicelist, 42)"
      },
      {
        "type": "code",
        "value": "condlist = [x<=4, x>3]\nchoicelist = [x, x**2]\nnp.select(condlist, choicelist, 55)"
      }
    ]

and in this branch

  "example_section_data": {
    "children": [
      {
        "type": "code",
        "value": "x = np.arange(6)\n"
      },
      {
        "type": "code",
        "value": "condlist = [x<3, x>3]\n"
      },
      {
        "type": "code",
        "value": "choicelist = [x, x**2]\n"
      },
      {
        "type": "code",
        "value": "np.select(condlist, choicelist, 42)\n"
      },
      {
        "type": "text",
        "value": "\n"
      },
      {
        "type": "code",
        "value": "condlist = [x<=4, x>3]\n"
      },
      {
        "type": "code",
        "value": "choicelist = [x, x**2]\n"
      },
      {
        "type": "code",
        "value": "np.select(condlist, choicelist, 55)\n"
      }
    ],

Compare the actual docstring:

    Examples
    --------
    >>> x = np.arange(6)
    >>> condlist = [x<3, x>3]
    >>> choicelist = [x, x**2]
    >>> np.select(condlist, choicelist, 42)
    array([ 0,  1,  2, 42, 16, 25])

    >>> condlist = [x<=4, x>3]
    >>> choicelist = [x, x**2]
    >>> np.select(condlist, choicelist, 55)
    array([ 0,  1,  2,  3,  4, 25])

Note the array([ 0, 1, 2, 3, 4, 25]) bits aren't included in the JSON anywhere.

Carreau · 2024-01-10T09:12:34Z

papyri/gen.py

+                    )
+                )
+            )
+        figs.extend(figs)


Suggested change

figs.extend(figs)

self.figs.extend(figs)

I think, or no figures will be saved I belive.

papyri/gen.py

Carreau · 2024-01-10T09:37:06Z

examples/dask.toml

@@ -41,3 +41,4 @@ exclude = [ "dask.utils:Dispatch",

 #docs_path = "~/dev/dask/docs/source"
 exec_failure = 'fallback'
+execute_doctests = false


You introduced this but are not using it, I'm guessing you intended something to exec. I pushed a commit that rename all the usage of config.exec to config.execute_doctests for it to work as intended. You naming is better.

Carreau · 2024-01-10T10:07:13Z

Ok, test are passing, let's merge and move on.

This is incomplete but I'm going to try to deal with the following: 1) each line in a black example is after jupyter#308 it's own line, so try to collapse subsequent code blocks. 2) It seem that we get a number of report_failure, but failure is just when the output does not match, though when we have a block with multiple >>> in sequence and we ignore output on purpose they now are seen as failure. It's not super great and will need a buch of workaround

asmeurer added 2 commits October 20, 2023 15:46

First pass at using the doctest module in get_example_data

7b6ec43

Fix matplotlib showing the plots instead of saving them in doctests

8e11f52

asmeurer marked this pull request as draft October 20, 2023 22:00

asmeurer added 4 commits November 10, 2023 16:13

Move the PapyriDocTestRunner class to the module level

3dd390b

Handle filename and lineno not being accessible

0bbbabe

They aren't actually that important for doctests because they are only used for the reporting, which we are bypassing anyways.

Fix a spelling error

d65f57c

Fix generation of doctest plot figures

471d729

asmeurer added 2 commits November 10, 2023 18:20

Remove unused variable

bbcb40e

Use ELLIPSIS option flag

29c66ac

Doesn't do anything right now because success/failure isn't saved

asmeurer marked this pull request as ready for review November 16, 2023 17:20

Carreau force-pushed the doctest branch from 2f94b73 to 6215dc9 Compare November 20, 2023 14:03

asmeurer and others added 14 commits November 27, 2023 11:48

First pass at using the doctest module in get_example_data

a13f7e4

Fix matplotlib showing the plots instead of saving them in doctests

9cdb1ae

Move the PapyriDocTestRunner class to the module level

0fba48d

Handle filename and lineno not being accessible

24389a4

They aren't actually that important for doctests because they are only used for the reporting, which we are bypassing anyways.

Fix a spelling error

d946cc6

Fix generation of doctest plot figures

877026f

Remove unused variable

cbd7fc0

Use ELLIPSIS option flag

83be405

Doesn't do anything right now because success/failure isn't saved

add debug print

1a0caee

debug log

98304d2

cleanup

236c309

... to >>>

841ac12

debug and execute config

ef6780e

don't execute in pandas

507e37c

Merge branch 'doctest' of github.com:asmeurer/papyri into doctest

8f6c1da

Fix errors from running doctests on SciPy

a2da2d8

asmeurer commented Dec 1, 2023

View reviewed changes

papyri/gen.py Show resolved Hide resolved

Merge branch 'main' into doctest

6da0c62

Fix the --no-exec flag

9bf10fb

Carreau added 2 commits December 16, 2023 14:13

Excluse set_numeric_ops which seem to be the root of crash.

a1c8cdb

reforamt to please linters

db1dbb4

Carreau reviewed Dec 17, 2023

View reviewed changes

asmeurer added 5 commits January 4, 2024 14:13

Merge branch 'main' into doctest

7ce945e

Fix the doctest runner not maintaining the namespace across different…

4d114ff

… blocks

Use DocTestParser to parse doctest blocks

4100b5e

This is more robust and correct than doing code.split('\n\n').

Don't generate text blocks for empty strings

dea1888

Fix duplicate code entries in example section

708dac9

Carreau reviewed Jan 10, 2024

View reviewed changes

Fix extending figures.

96502c7

Carreau reviewed Jan 10, 2024

View reviewed changes

papyri/gen.py Outdated Show resolved Hide resolved

Carreau added 2 commits January 10, 2024 01:24

Apply suggestions from code review

eb498cb

Rename config.exec to execute_doctests.

4621cba

Carreau reviewed Jan 10, 2024

View reviewed changes

Carreau added 2 commits January 10, 2024 10:41

Rename config.exec to execute_doctests.

45a2fa3

rename exec in a few more places

81071b9

Carreau merged commit 1861202 into jupyter:main Jan 10, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the doctest module in get_example_data #308

Use the doctest module in get_example_data #308

asmeurer commented Oct 20, 2023 •

edited

Loading

asmeurer commented Nov 11, 2023

Carreau commented Nov 16, 2023 •

edited

Loading

asmeurer commented Nov 16, 2023

Carreau commented Nov 17, 2023

Carreau commented Nov 17, 2023 •

edited

Loading

asmeurer commented Dec 1, 2023

asmeurer commented Dec 15, 2023

asmeurer commented Dec 15, 2023

Carreau commented Dec 16, 2023

Carreau Dec 17, 2023

asmeurer commented Jan 4, 2024 •

edited

Loading

Carreau Jan 10, 2024

Carreau Jan 10, 2024

Carreau commented Jan 10, 2024

Use the doctest module in get_example_data #308

Use the doctest module in get_example_data #308

Conversation

asmeurer commented Oct 20, 2023 • edited Loading

asmeurer commented Nov 11, 2023

Carreau commented Nov 16, 2023 • edited Loading

asmeurer commented Nov 16, 2023

Carreau commented Nov 17, 2023

Carreau commented Nov 17, 2023 • edited Loading

asmeurer commented Dec 1, 2023

asmeurer commented Dec 15, 2023

asmeurer commented Dec 15, 2023

Carreau commented Dec 16, 2023

Carreau Dec 17, 2023

Choose a reason for hiding this comment

asmeurer commented Jan 4, 2024 • edited Loading

Carreau Jan 10, 2024

Choose a reason for hiding this comment

Carreau Jan 10, 2024

Choose a reason for hiding this comment

Carreau commented Jan 10, 2024

asmeurer commented Oct 20, 2023 •

edited

Loading

Carreau commented Nov 16, 2023 •

edited

Loading

Carreau commented Nov 17, 2023 •

edited

Loading

asmeurer commented Jan 4, 2024 •

edited

Loading