Uploaded image for project: 'Apache MXNet (Retired)'
  1. Apache MXNet (Retired)
  2. MXNET-99

P3 (Nvidia V100) not supported by CUDA8

Add voteWatch issue
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Done
    • Major
    • Resolution: Unresolved
    • CI Build
    • None

    Description

      During my tests to upgrade our CI for [Quantization support|https://github.com/apache/incubator-mxnet/pull/9552,] I have noticed that NVCC supplied with CUDA 8 does not support the GPU architecture 7.0, which represents Volta GPUs used in AWS P3 instances. Our Makefile automatically removes this architecture in https://github.com/apache/incubator-mxnet/blob/master/Makefile#L280, which is fine since it would crash otherwise.

       

      This causes pre-compilation to generate incompatible PTX binaries and thus causing issues during runtime - the JIT compiler has to kick in and libraries like CuBLAS report CUBLAS_STATUS_ARCH_MISMATCH. 

       

      Attachments

        Issue Links

          Activity

            People

              marcoabreu Marco de Abreu
              marcoabreu Marco de Abreu

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 6.5h
                  6.5h

                  Slack

                    Issue deployment