Split out `numpy` and `numpy-tests`, and update to NumPy v2.3.1 #86

agriyakhetarpal · 2025-05-03T09:04:39Z

This PR splits out numpy into two packages using the Meson install tags feature, one that installs runtime,python-runtime,devel and another that installs tests. This is an easier way to unvendor the tests from NumPy's wheels. It is to be noted that the package is still built fully – it is just that the relevant files are not installed into the wheel.

I've also updated to NumPy version 2.3.1, which was recently released, and dropped the associated patch from numpy/numpy#28936 as it is no longer needed.

agriyakhetarpal · 2025-05-03T11:04:26Z

AttributeError: module 'numpy' has no attribute 'zeros' is rather a strange problem – does this suggest that NumPy's test unvendoring is broken?

agriyakhetarpal · 2025-05-03T11:09:55Z

It still takes [34/36] (thread 4) built numpy-tests in 1m 24s, but that's much less than the 3.5 minutes required to build NumPy alone. So either this is working well and we are good to go, or that it isn't working well and the measurement is inaccurate – as NumPy is being built with other packages, and thus takes time – it could have taken 1m 24s too if no other packages were being built.

ryanking13 · 2025-05-04T04:10:03Z

I checked the build log of numpy-tests and it looks like it still builds all the source codes in numpy (you can download it from the GHA artifacts). Is it intended behavior?

numpy-tests.log

agriyakhetarpal · 2025-05-04T08:27:27Z

Yes, this is intended behaviour. The install tags only affect what files are installed, i.e., copied into the final wheels. In this case, these are the test-specific extension modules and Python files, but all files are still built.

I checked the artifacts and it is working as expected: the numpy wheel is 3.1 MiB, and the numpy-tests wheel is 1.6 MiB, the total of which matches the regular wheel build size.

Now I have to figure out why it doesn't work… because if/when it does, we have previously seen that we can compile from a shared directory in around ~8 seconds, and we can take advantage of that.

agriyakhetarpal · 2025-05-04T08:36:23Z

Ah, okay, I got why it doesn't work by checking the logs. The build dir for numpy-tests is:

Build dir: /home/runner/work/pyodide-recipes/pyodide-recipes/packages/numpy-tests/build/numpy-tests-2.2.5/build

when it should be:

Build dir: /home/runner/work/pyodide-recipes/pyodide-recipes/packages/numpy/build/numpy-2.2.5/build

We need to make the path: field in

source:
  path: ../numpy/build/numpy-2.2.5/

work to do so. The documentation for source/path says that relative paths are already supported, so maybe it's just that I didn't add this correctly?

ryanking13 · 2025-05-04T11:42:41Z

work to do so. The documentation for source/path says that relative paths are already supported, so maybe it's just that I didn't add this correctly?

No, I think it is how pyodide-build works (for now). When one sets source/path, we copy it to the build directory first to avoid polluting the source directory (code pointer).

So what happens now is /numpy/build/numpy-2.2.5/ directory is copied to numpy-tests/build/numpy-test-2.2.5/, and build happens in that directory.

Now I have to figure out why it doesn't work… because if/when it does, we have previously seen that we can compile from a shared directory in around ~8 seconds, and we can take advantage of that.

One possibility I can think of is that the mtime of the files was modified when the file was copied, and because of that, meson thinks it needs to recompile all the files, which causes all the compilation to happen again. Or, if recompilation is the expected behavior, it could be that the path of the file has changed, causing a cache miss.

ryanking13 · 2025-05-04T12:54:57Z

I know that this is in contrast to #74

BTW, this is not in contrast to #74, my intention for #74 was to remove all internal test packages such as cpp-exception-test or fpcast-test, not unvendored tests for other packages.

agriyakhetarpal · 2025-05-04T15:10:42Z

we copy it to the build directory first to avoid polluting the source directory (code pointer).

I see, that makes sense. Considering that we've been advertising/using this for local testing and debugging only, would you be willing to change this behaviour? That is, we could modify source/path so that we don't copy anything to the /build/ directory, and rather assume that this field will be used to represent the location of an already extracted archive (which is why we don't need the checksum for verification either).

I think such a way would be easier to manage in comparison to comparing mtimes.

However, one issue will still exist which we haven't sorted out: numpy-tests is separately installable and shouldn't need to depend on NumPy at either build-time or runtime, but it does here because there's no other way to ensure that NumPy is already built. In our case, we don't have the flexibility to change this. One hacky way could be to build all recipes first, and build numpy-tests in a separate pyodide build-recipes command.

ryanking13 · 2025-05-05T05:07:05Z

Considering that we've been advertising/using this for local testing and debugging only, would you be willing to change this behaviour? That is, we could modify source/path so that we don't copy anything to the /build/ directory, and rather assume that this field will be used to represent the location of an already extracted archive (which is why we don't need the checksum for verification either).

Yeah, I think you can try to see if it works. I guess changing the code in _prepare_source to something like

        srcdir = self.source_metadata.path.resolve()
+        self.src_extract_dir = srcdir
-        if not srcdir.is_dir():
-            raise ValueError(f"path={srcdir} must point to a directory that exists")
-
-        def ignore(path: str, names: list[str]) -> list[str]:
-            ignored: list[str] = []
-
-            if fnmatch.fnmatch(path, "*/dist"):
-                # Do not copy dist/*.whl files from a dirty source tree;
-                # this can lead to "Exception: Unexpected number of wheels" later.
-                ignored.extend(name for name in names if name.endswith(".whl"))
-            return ignored
-
-        shutil.copytree(srcdir, self.src_extract_dir, ignore=ignore)

would do the job, but not 100% sure.

Reusing the src directory for the build directory will make cleaning up the build directory or rebuilding a little bit more complex, but I think it is not so big problem.

there's no other way to ensure that NumPy is already built.

I think the current approach (setting numpy as a host dependency) is sufficient, at least for now. It's a very special case for two recipes to share source code like this, and I don't want to add too much complex behavior to handle this.

This reverts commit 9ac87e0.

agriyakhetarpal · 2025-05-20T23:24:19Z

Sorry for getting back to this a little late, and thanks for this patch! Yes, it worked for the build case perfectly – barring one minor problem, where the numpy-tests wheel gets placed in two locations: packages/numpy-tests/dist/ (which is where we want it to be), and also in packages/numpy-tests/build/numpy-tests-2.2.5/dist/. I assume that the wheel is copied into the former directory from the latter directory. This is quite easily overridable using the --install-dir, which we are already setting.

However, I do have my reservations around if we need such a patch at all in this case (especially for such a special case). I've pushed a few commits, please feel free to take a look!

agriyakhetarpal · 2025-05-21T00:06:51Z

Okay, so apparently pyodide build-recipes "!numpy-tests" is skipping numpy as well here, when it should only skip the entire package name. This looks like a bug where we parse the package queries in the graph builder. I'll take a look at it.

I resolved this by using '*,!numpy-tests' instead, which seems to work well.

agriyakhetarpal · 2025-06-24T17:59:15Z

Okay, there's just one test to get through:

@pytest.mark.skip_pyproxy_check
def test_runpythonasync_numpy(selenium_standalone):
    selenium_standalone.run_async(
        """
        import numpy as np
        x = np.zeros(5)
        """
    )
    for i in range(5):
        assert selenium_standalone.run_js(
            f"return pyodide.globals.get('x').toJs()[{i}] == 0"
        )

which says

FAILED packages/numpy/test_numpy.py::test_runpythonasync_numpy[chrome] - pytest_pyodide.runner.JavascriptException: PythonError: Traceback (most recent call last):
  File "/lib/python313.zip/_pyodide/_base.py", line 597, in eval_code_async
    await CodeRunner(
    ...<9 lines>...
    .run_async(globals, locals)
  File "/lib/python313.zip/_pyodide/_base.py", line 411, in run_async
    coroutine = eval(self.code, globals, locals)
  File "<exec>", line 3, in <module>
AttributeError: module 'numpy' has no attribute 'zeros'

I'm not too sure why, but I can debug it with NumPy outside this build – probably broken from the test splits.

github-actions · 2025-06-24T18:34:47Z

Package Build Results

Total packages built: 30
Total build time: 0:05:22

Package Build Times (click to expand)

Package	Build Time
openssl	4m 16s
numpy	3m 48s
sqlite3	1m 39s
numpy-tests	1m 33s
liblzma	1m 9s
test	25s
regex	12s
ssl	12s
hashlib	11s
lzma	5s
pydecimal	4s
pydoc_data	4s
MarkupSafe	4s
atomicwrites	3s
packaging	2s
pytz	1s
exceptiongroup	1s
pytest	1s
Jinja2	1s
more-itertools	1s
micropip	1s
iniconfig	1s
attrs	1s
tblib	1s
pytest-asyncio	1s
pluggy	1s
py	1s
setuptools	1s
pyparsing	1s
six	0s

Longest build: openssl (4m 16s)
Packages built in more than 10 minutes: 0

ryanking13 · 2025-06-25T10:59:31Z

I'm not too sure why, but I can debug it with NumPy outside this build – probably broken from the test splits.

I saw this happening time to time. I think there is an unknown flakiness in our build system, but I really don't understand why.

ryanking13

Awesome! Thanks for your work @agriyakhetarpal, and also thanks for updating the numpy version.

After merging this, we should also update the numpy version in pyodide/pyodide to align the numpy version in xbuildenv (yes, it is a bit annoying but should be done until we implmenent pyodide/pyodide-build#43).

ryanking13 · 2025-06-25T11:00:45Z

packages/numpy/test_numpy.py

@@ -3,7 +3,7 @@


 def test_numpy(selenium):
-    selenium.load_package("numpy")
+    selenium.load_package(["numpy", "numpy-tests"])


I don't think we need to import numpy-tests in this file.

Would you like to add a separate test file under packages/numpy-tests that actually runs some tests that are included in numpy-tests instead?

Makes sense, thanks! I added the full test suite in ae85b6d, and I will reduce it in subsequent commits before we merge this, so that CI time is not impacted.

I think a subset of the tests that would be fine for us would be those from numpy.linalg, numpy.fft, numpy.polynomial, numpy.random, and numpy.lib. We can leave out numpy.f2py (won't work anyway), numpy.strings, numpy.char, and numpy.ma. This is, unless you may have any other ideas here.

Yes, numpy is very important so I am okay with having more tests, but running all test suite will be too time consuming, so it would be nice if we could find a good Goldilocks point.

agriyakhetarpal · 2025-06-25T17:54:26Z

I tried to run the tests locally after splitting them out, but I don't think this is the right approach for testing them. For example, https://docs.scipy.org/doc/scipy/building/redistributable_binaries.html does not mention how to run the tests from the split wheels. I notice that the numpy-tests package is slightly broken, it says ERROR: module or package not found: numpy (missing __init__.py?).

One way to get around this is to install numpy-tests, copy the package tree entirely (which consists of just the tests), uninstall it, install numpy, copy the tests into the numpy wheels, and then run the tests from --pyargs numpy. In such a case, it is just better to build and test numpy without removing its tests at all.

I tried to follow the approach in pandas-dev/pandas#53007, but I don't think this is something we should be doing here – we should wait for developments on numpy/numpy#26289 first. It is only after numpy-tests becomes installable as a separate package alongside numpy that we will be able to test this properly.

So, would you be okay with proceeding without these tests, given that NumPy functionality is being tested out-of-tree to a reasonable extent, and also here in the numpy recipe, where the tests have been left unchanged in this PR? These would have been just additional tests, and no tests have been removed at this time as part of the split.

This reverts commit 609842d.

This reverts commit 1d6f2a8.

This reverts commit 000c23b.

This reverts commit ae85b6d.

ryanking13 · 2025-06-26T07:08:04Z

So, would you be okay with proceeding without these tests, given that NumPy functionality is being tested out-of-tree to a reasonable extent, and also here in the numpy recipe, where the tests have been left unchanged in this PR? These would have been just additional tests, and no tests have been removed at this time as part of the split.

Sure, that is okay with me.

agriyakhetarpal · 2025-06-26T10:13:04Z

Thanks! I'll open a follow-up issue to discuss more testing for NumPy and a follow-up PR to update NumPy on the Pyodide repository side.

….1 (#86)" This reverts commit d7901ed.

agriyakhetarpal added 7 commits May 3, 2025 14:29

Update to NumPy 2.2.5

8e5d616

Don't unvendor tests using our script

61c93e7

Don't build NumPy tests, use persistent build dir

9d22ab8

Add numpy-tests package

8741e42

Oops, duplicate key

5aec9e0

Swap build directories for source.path

56f76cc

Also import `"numpy-tests" with "numpy"

491cfc7

agriyakhetarpal added 8 commits May 21, 2025 02:28

Merge remote-tracking branch 'upstream/main' into numpy-tests

d37e2d7

Add the patch for numpy-tests, too

9ac87e0

Revert "Add the patch for numpy-tests, too"

f88a37a

This reverts commit 9ac87e0.

Add note to NumPy recipe

8f54f70

Add note to numpy-tests recipe

e3a920d

Don't require NumPy as a host dep for numpy-tests

246cb4b

Include numpy-tests for most packages' tests

ecfbb08

Build recipes in two commands

f713287

agriyakhetarpal changed the title ~~numpy and numpy-tests debugging (do not merge)~~ numpy and numpy-tests debugging (do not merge) [full build] May 20, 2025

agriyakhetarpal added 2 commits May 21, 2025 05:01

Oops, don't build numpy-tests in round one

cd4f92d

Fix GHA group name

22a6f1a

Fix parsing for packages to be ignored

e20851f

agriyakhetarpal added 2 commits June 24, 2025 21:10

Reset to original build command, instead of rounds

1fff045

Fix selenium.load_package() call args

2505dad

agriyakhetarpal changed the title ~~numpy and numpy-tests debugging (do not merge)~~ Split out numpy and numpy-tests Jun 24, 2025

Debug failing np.zeros test

865e034

agriyakhetarpal marked this pull request as ready for review June 24, 2025 18:46

agriyakhetarpal requested a review from ryanking13 June 24, 2025 18:46

agriyakhetarpal changed the title ~~Split out numpy and numpy-tests~~ Split out numpy and numpy-tests, and update to NumPy v2.3.1 Jun 24, 2025

agriyakhetarpal mentioned this pull request Jun 24, 2025

Build recipes from a shared directory [DRAFT] pyodide/pyodide-build#195

Closed

ryanking13 approved these changes Jun 25, 2025

View reviewed changes

agriyakhetarpal added 5 commits June 25, 2025 17:39

Don't import numpy-tests for NumPy's tests

89c9b6a

Run NumPy test suite

ae85b6d

Rename test file to have a unique name

000c23b

Oops, missed async def

1d6f2a8

Try increasing the timeout

609842d

agriyakhetarpal added 4 commits June 25, 2025 23:25

Revert "Try increasing the timeout"

213db3a

This reverts commit 609842d.

Revert "Oops, missed async def"

67139ad

This reverts commit 1d6f2a8.

Revert "Rename test file to have a unique name"

d48ecbe

This reverts commit 000c23b.

Revert "Run NumPy test suite"

92ce863

This reverts commit ae85b6d.

agriyakhetarpal requested a review from ryanking13 June 25, 2025 18:43

agriyakhetarpal merged commit d7901ed into pyodide:main Jun 26, 2025
4 checks passed

agriyakhetarpal deleted the numpy-tests branch June 26, 2025 10:14

This was referenced Jun 26, 2025

Better testing for NumPy #135

Open

Bump to NumPy 2.3.1, and split out numpy-tests (backport of pyodide/pyodide-recipes#86) pyodide/pyodide#5715

Draft

ryanking13 added a commit that referenced this pull request Jul 3, 2025

Revert "Split out numpy and numpy-tests, and update to NumPy v2.3…

47be1d4

….1 (#86)" This reverts commit d7901ed.

Split out numpy and numpy-tests, and update to NumPy v2.3.1 #86

Split out numpy and numpy-tests, and update to NumPy v2.3.1 #86

Uh oh!

Conversation

agriyakhetarpal commented May 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agriyakhetarpal commented May 3, 2025

Uh oh!

agriyakhetarpal commented May 3, 2025

Uh oh!

ryanking13 commented May 4, 2025

Uh oh!

agriyakhetarpal commented May 4, 2025

Uh oh!

agriyakhetarpal commented May 4, 2025

Uh oh!

ryanking13 commented May 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryanking13 commented May 4, 2025

Uh oh!

agriyakhetarpal commented May 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryanking13 commented May 5, 2025

Uh oh!

agriyakhetarpal commented May 20, 2025

Uh oh!

agriyakhetarpal commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agriyakhetarpal commented Jun 24, 2025

Uh oh!

github-actions bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Package Build Results

Uh oh!

ryanking13 commented Jun 25, 2025

Uh oh!

ryanking13 left a comment

Choose a reason for hiding this comment

Uh oh!

ryanking13 Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

agriyakhetarpal Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

agriyakhetarpal Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

ryanking13 Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

agriyakhetarpal commented Jun 25, 2025

Uh oh!

ryanking13 commented Jun 26, 2025

Uh oh!

agriyakhetarpal commented Jun 26, 2025

Uh oh!

Uh oh!

Uh oh!

Split out `numpy` and `numpy-tests`, and update to NumPy v2.3.1 #86

Split out `numpy` and `numpy-tests`, and update to NumPy v2.3.1 #86

agriyakhetarpal commented May 3, 2025 •

edited

Loading

ryanking13 commented May 4, 2025 •

edited

Loading

agriyakhetarpal commented May 4, 2025 •

edited

Loading

agriyakhetarpal commented May 21, 2025 •

edited

Loading

github-actions bot commented Jun 24, 2025 •

edited

Loading