Skip to content

Reenable test_nn tests for Windows #52051

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

janeyx99
Copy link
Contributor

@janeyx99 janeyx99 commented Feb 10, 2021

Fixes #52002

@janeyx99 janeyx99 force-pushed the ci-all/reenable-windows-test_nn-tests branch from 0d03427 to 843c708 Compare February 11, 2021 00:57
@janeyx99 janeyx99 requested a review from ngimel February 11, 2021 20:58
Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@janeyx99 janeyx99 requested a review from a team February 11, 2021 21:36
Copy link
Collaborator

@ngimel ngimel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you have to reenable embedding test somewhere?

@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Feb 12, 2021

💊 CI failures summary and remediations

As of commit 9c7a739 (more details on the Dr. CI page):


  • 18/18 failures possibly* introduced in this PR
    • 1/18 non-CircleCI failure(s)

🕵️ 14 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

See CircleCI build binary_windows_conda_3_6_cu102_nightly_test (1/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_6_cu112_nightly_test (2/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_wheel_3_9_cu102_nightly_build (3/14)

Step: "Build" (full log | diagnosis details | 🔁 rerun)

echo CUDA 10.2 installed failed.

C:\w\b>set "PATH=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin;C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\libnvvp;C:\Program Files (x86)\Windows Application Driver;C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\MSBuild\Current\Bin;C:\Program Files (x86)\Microsoft Visual Studio\Installer\;C:\tools\ruby26;C:\tools\ruby26\bin;C:\ProgramData\nvm;C:\tools\miniconda3;C:\tools\miniconda3\Library\mingw-w64\bin;C:\tools\miniconda3\Library\usr\bin;C:\tools\miniconda3\Library\bin;C:\tools\miniconda3\Scripts;C:\miniconda3\miniconda3\condabin;C:\Program Files (x86)\Microsoft SDKs\Azure\CLI2\wbin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Windows\System32\OpenSSH\;C:\ProgramData\GooGet;C:\Program Files\Google\Compute Engine\metadata_scripts;C:\Program Files (x86)\Google\Cloud SDK\google-cloud-sdk\bin;C:\Program Files\PowerShell\7\;C:\Program Files\Google\Compute Engine\sysprep;C:\Program Files\Docker;C:\ProgramData\chocolatey\bin;C:\Program Files\Git\cmd;C:\Program Files\Git\mingw64\bin;C:\Program Files\Git\usr\bin;C:\Program Files\Git LFS;C:\Program Files\Amazon\AWSCLI\bin\;C:\Program Files\Microsoft Service Fabric\bin\Fabric\Fabric.Code;C:\Program Files\Microsoft SDKs\Service Fabric\Tools\ServiceFabricLocalClusterManager;C:\Program Files (x86)\vim\vim80;C:\Go\bin;C:\Program Files\OpenJDK\jdk-12.0.2\bin;C:\ProgramData\nvm;C:\Program Files\nodejs;C:\Program Files\dotnet\;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit\;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\;C:\Program Files\Microsoft SQL Server\Client SDK\ODBC\170\Tools\Binn\;C:\Program Files (x86)\IncrediBuild;C:\Users\circleci\AppData\Local\Microsoft\WindowsApps" 

C:\w\b>set "CUDA_PATH=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2" 

C:\w\b>set "CUDA_PATH_V10_2=C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2" 

C:\w\b>set "NVTOOLSEXT_PATH=C:\Program Files\NVIDIA Corporation\NvToolsExt" 

C:\w\b>if not exist "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.2\bin\nvcc.exe" (
echo CUDA 10.2 installed failed.  
 exit /b 1 
) 

C:\w\b>echo Installing cuDNN... 
Installing cuDNN...

C:\w\b>7z x C:\w\b\windows\internal\\..\temp_build\cudnn-10.2-windows10-x64-v7.6.5.32.zip -o"C:\w\b\windows\internal\\..\temp_build\cudnn" 

7-Zip 19.00 (x64) : Copyright (c) 1999-2018 Igor Pavlov : 2019-02-21

See CircleCI build binary_windows_conda_3_9_cu102_nightly_test (4/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_8_cu101_nightly_test (5/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_7_cu112_nightly_test (6/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_9_cu112_nightly_test (7/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_6_cu101_nightly_test (8/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_8_cu102_nightly_test (9/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_7_cu102_nightly_test (10/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_9_cu101_nightly_test (11/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build binary_windows_conda_3_7_cu101_nightly_test (12/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1

See CircleCI build pytorch_windows_vs2019_py36_cuda10.1_build (13/14)

Step: "Install Cuda" (full log | diagnosis details | 🔁 rerun)

ls: cannot access '/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin/nvcc.exe': No such file or directory

Folders: 11
Files: 130
Size:       907512
Compressed: 111420
+ mkdir -p 'C:/Program Files/NVIDIA Corporation/NvToolsExt'
+ cp -r NvToolsExt/bin NvToolsExt/docs NvToolsExt/include NvToolsExt/lib NvToolsExt/samples 'C:/Program Files/NVIDIA Corporation/NvToolsExt/'
+ export 'NVTOOLSEXT_PATH=C:\Program Files\NVIDIA Corporation\NvToolsExt\'
+ NVTOOLSEXT_PATH='C:\Program Files\NVIDIA Corporation\NvToolsExt\'
+ ls '/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin/nvcc.exe'
ls: cannot access '/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin/nvcc.exe': No such file or directory
+ echo 'CUDA installation failed'
CUDA installation failed
+ mkdir -p /c/w/build-results
+ 7z a 'c:\w\build-results\cuda_install_logs.7z' cuda_install_logs

7-Zip 19.00 (x64) : Copyright (c) 1999-2018 Igor Pavlov : 2019-02-21

Scanning the drive:
1 folder, 2 files, 3721951 bytes (3635 KiB)

See CircleCI build binary_windows_conda_3_8_cu112_nightly_test (14/14)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

AssertionError: Torch not compiled with CUDA enabled
) 

(testenv) C:\w\b\windows>echo Checking that CUDA archs are setup correctly 
Checking that CUDA archs are setup correctly

(testenv) C:\w\b\windows>python -c "import torch; torch.randn([3,5]).cuda()" 
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "C:\w\b\windows\conda\envs\testenv\lib\site-packages\torch\cuda\__init__.py", line 164, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

(testenv) C:\w\b\windows>if ERRORLEVEL 1 exit /b 1 


Exited with code exit status 1


3 failures not recognized by patterns:

Job Step Action
CircleCI binary_windows_wheel_3_9_cpu_nightly_build Build 🔁 rerun
CircleCI binary_windows_wheel_3_9_cu101_nightly_build Build 🔁 rerun
CircleCI binary_windows_wheel_3_9_cu112_nightly_build Build 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

@janeyx99
Copy link
Contributor Author

Do you have to reenable embedding test somewhere?

Ah yes, sorry this was sitting on top of another PR. The changes would be here: 843c708

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@janeyx99 janeyx99 force-pushed the ci-all/reenable-windows-test_nn-tests branch from 843c708 to 9c7a739 Compare February 12, 2021 22:10
Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@janeyx99 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@@ -447,7 +447,7 @@ Tensor _embedding_bag_per_sample_weights_backward_cuda(
AT_ASSERT(weight.dim() == 2);
AT_ASSERT(weight.size(1) == embedding_features);

const int threads_per_block = 1024;
const int threads_per_block = 512;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we plan do something like

#ifdef _MSC_VER
const int threads_per_block = 512;
#else
const int threads_per_block = 1024;
#endif

@facebook-github-bot
Copy link
Contributor

@janeyx99 merged this pull request in 68e2a8c.

xsacha pushed a commit to xsacha/pytorch that referenced this pull request Mar 31, 2021
Summary:
Fixes pytorch#52002

Pull Request resolved: pytorch#52051

Reviewed By: ngimel

Differential Revision: D26409749

Pulled By: janeyx99

fbshipit-source-id: 5fa76d4fff8cf0fe2130c925fde9dffd0d1e7172
@github-actions github-actions bot deleted the ci-all/reenable-windows-test_nn-tests branch February 10, 2024 01:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CUDA error: too many resources requested for launch in Windows CI for CUDA 11.2
4 participants