Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TensorFlow-2.8.0-foss-2021b.eb issue with tf-estimator-nightly #20360

Open
VictorGoitea opened this issue Apr 14, 2024 · 1 comment
Open

TensorFlow-2.8.0-foss-2021b.eb issue with tf-estimator-nightly #20360

VictorGoitea opened this issue Apr 14, 2024 · 1 comment

Comments

@VictorGoitea
Copy link
Contributor

Hi, I need to install tensorflow 2.8.0 because it is a dependency for another program that needs this specific version. So, I was using the easyconfig for version 2.8.4 as a template but seems there are subtle change of dependencies, and I am struggling with
the dependency 'tf_estimator_nightly', version '2.8.0.dev2021122109'. It keeps failing and I have tried both "tf_estimator_nightly" and "tf-estimator-nightly". "-" seems to tigger a Syntax error and "_" triggers "ModuleNotFoundError: No module named 'tf_estimator_nightly'". I have also tried including a line of 'sanity_pip_check': False and did not work.

If you have any ideas how overcome this issue, I would be glad to hear it.

== FAILED: Installation ended unsuccessfully (build directory: /home/ucloud/.local/easybuild/build/TensorFlow/2.8.0/foss-2021b): build failed (firs
t 300 chars): Sanity check failed: extensions sanity check failed for 1 extensions: tf-estimator-nightly                                           
failing sanity check for 'tf-estimator-nightly' extension: command "python -c "import tf-estimator-nightly"" failed; output:                       
File "<string>", line 1                                                                                                                            
      import tf-estimator-nightly                                                                                                                    
             ^                                                                                                                                     
SyntaxError:  (took 2 hours 3 mins 12 secs)                                                                                                        
== Results of the build can be found in the log file(s) /tmp/eb-2jv5j576/easybuild-TensorFlow-2.8.0-20240414.102643.jVjRQ.log                      
ERROR: Build of /home/ucloud/.local/easybuild/easyconfigs/t/TensorFlow/TensorFlow-2.8.0-foss-2021b.eb failed (err: 'build failed (first 300 chars):
 Sanity check failed: extensions sanity check failed for 1 extensions: tf-estimator-nightly\nfailing sanity check for \'tf-estimator-nightly\' exte
nsion: command "python -c "import tf-estimator-nightly"" failed; output:\nFile "<string>", line 1\n    import tf-estimator-nightly\n             ^\
nSyntaxError: ') 

Here is the eb file I'am trying:

    # TensorFlow-2.8.0-foss-2021b.eb
    
    easyblock = 'PythonBundle'
    
    name = 'TensorFlow'
    version = '2.8.0'
    
    homepage = 'https://www.tensorflow.org/'
    description = "An open-source software library for Machine Intelligence"
    
    toolchain = {'name': 'foss', 'version': '2021b'}
    toolchainopts = {'pic': True}
    
    builddependencies = [
        ('Bazel', '4.2.2'),
        ('protobuf', '3.17.3'),
        # git 2.x required, see also https://github.com/tensorflow/tensorflow/issues/29053
        ('git', '2.33.1', '-nodocs'),
        ('pybind11', '2.7.1'),
        ('UnZip', '6.0'),
    ]
    dependencies = [
        ('Python', '3.9.6'),
        ('h5py', '3.6.0'),
        ('cURL', '7.78.0'),
        ('dill', '0.3.4'),
        ('double-conversion', '3.1.5'),
        ('flatbuffers', '2.0.0'),
        ('giflib', '5.2.1'),
        ('hwloc', '2.5.0'),
        ('ICU', '69.1'),
        ('JsonCpp', '1.9.4'),
        ('libjpeg-turbo', '2.0.6'),
        ('LMDB', '0.9.29'),
        ('NASM', '2.15.05'),
        ('nsync', '1.24.0'),
        ('SQLite', '3.36'),
        ('protobuf-python', '3.17.3'),
        ('flatbuffers-python', '2.0'),
        ('libpng', '1.6.37'),
        ('snappy', '1.1.9'),
        ('zlib', '1.2.11'),
    ]
    
    use_pip = True
    sanity_pip_check = True
    
    # Dependencies created and updated using findPythonDeps.sh:
    # https://gist.github.com/Flamefire/49426e502cd8983757bd01a08a10ae0d
    exts_list = [
        ('wrapt', '1.14.1', {
            'checksums': ['380a85cf89e0e69b7cfbe2ea9f765f004ff419f34194018a6827ac0e3edfed4d'],
        }),
        ('tf_estimator_nightly', '2.8.0.dev2021122109', {
            'source_tmpl': 'tf_estimator_nightly-%(version)s-py2.py3-none-any.whl',
            'checksums': ['0065a04e396b2890bd19761fc1de7559ceafeba12839f8db2c7e7473afaaf612'],
        }),
        ('termcolor', '1.1.0', {
            'checksums': ['1d6d69ce66211143803fbc56652b41d73b4a400a2891d7bf7a1cdf4c02de613b'],
        }),
        ('Werkzeug', '2.1.2', {
            'checksums': ['1ce08e8093ed67d638d63879fd1ba3735817f7a80de3674d293f5984f25fb6e6'],
        }),
        ('tensorboard_plugin_wit', '1.8.1', {
            'source_tmpl': SOURCE_PY3_WHL,
            'checksums': ['ff26bdd583d155aa951ee3b152b3d0cffae8005dc697f72b44a8e8c2a77a8cbe'],
        }),
        ('tensorboard-data-server', '0.6.1', {
            'source_tmpl': 'tensorboard_data_server-%(version)s-py3-none-any.whl',
            'checksums': ['809fe9887682d35c1f7d1f54f0f40f98bb1f771b14265b453ca051e2ce58fca7'],
        }),
        ('Markdown', '3.3.7', {
            'checksums': ['cbb516f16218e643d8e0a95b309f77eb118cb138d39a4f27851e6a63581db874'],
        }),
        ('grpcio', '1.46.1', {
            'modulename': 'grpc',
            'preinstallopts': "export GRPC_PYTHON_BUILD_EXT_COMPILER_JOBS=%(parallel)s && ",
            'checksums': ['4835b0f5fedbee3a3d6eea48f4e65dffd30b52c078690fa97ddc9fcea1e3b35d'],
        }),
        ('oauthlib', '3.2.0', {
            'checksums': ['23a8208d75b902797ea29fd31fa80a15ed9dc2c6c16fe73f5d346f83f6fa27a2'],
        }),
        ('requests-oauthlib', '1.3.1', {
            'checksums': ['75beac4a47881eeb94d5ea5d6ad31ef88856affe2332b9aafb52c6452ccf0d7a'],
        }),
        ('rsa', '4.8', {
            'checksums': ['5c6bd9dc7a543b7fe4304a631f8a8a3b674e2bbfc49c2ae96200cdbe55df6b17'],
        }),
        ('pyasn1-modules', '0.2.8', {
            'checksums': ['905f84c712230b2c592c19470d3ca8d552de726050d1d1716282a1f6146be65e'],
        }),
        ('cachetools', '5.1.0', {
            'checksums': ['8b3b8fa53f564762e5b221e9896798951e7f915513abf2ba072ce0f07f3f5a98'],
        }),
        ('google-auth', '2.6.6', {
            'modulename': 'google.auth',
            'checksums': ['1ba4938e032b73deb51e59c4656a00e0939cf0b1112575099f136babb4563312'],
        }),
        ('google-auth-oauthlib', '0.4.6', {
            'checksums': ['a90a072f6993f2c327067bf65270046384cda5a8ecb20b94ea9a687f1f233a7a'],
        }),
        ('tensorboard', '2.8.0', {
            'source_tmpl': SOURCE_PY3_WHL,
            'checksums': ['65a338e4424e9079f2604923bdbe301792adce2ace1be68da6b3ddf005170def'],
        }),
        ('opt-einsum', '3.3.0', {
            'source_tmpl': 'opt_einsum-%(version)s.tar.gz',
            'checksums': ['59f6475f77bbc37dcf7cd748519c0ec60722e91e63ca114e68821c0c54a46549'],
        }),
        ('Keras-Preprocessing', '1.1.2', {
            'source_tmpl': 'Keras_Preprocessing-%(version)s.tar.gz',
            'checksums': ['add82567c50c8bc648c14195bf544a5ce7c1f76761536956c3d2978970179ef3'],
        }),
        ('keras', '2.8.0', {
            'source_tmpl': SOURCE_WHL,
            'checksums': ['744d39dc6577dcd80ff4a4d41549e92b77d6a17e0edd58a431d30656e29bc94e'],
        }),
        ('google-pasta', '0.2.0', {
            'modulename': 'pasta',
            'checksums': ['c9f2c8dfc8f96d0d5808299920721be30c9eec37f2389f28904f454565c8a16e'],
        }),
        ('astunparse', '1.6.3', {
            'checksums': ['5ad93a8456f0d084c3456d059fd9a92cce667963232cbf763eac3bc5b7940872'],
        }),
        ('tblib', '1.7.0', {
            'checksums': ['059bd77306ea7b419d4f76016aef6d7027cc8a0785579b5aad198803435f882c'],
        }),
        ('portpicker', '1.5.2', {
            'checksums': ['c55683ad725f5c00a41bc7db0225223e8be024b1fa564d039ed3390e4fd48fb3'],
        }),
        ('absl-py', '1.0.0', {
            'modulename': 'absl',
            'checksums': ['ac511215c01ee9ae47b19716599e8ccfa746f2e18de72bdf641b79b22afa27ea'],
        }),
        # System dependency astor_archive
        ('astor', '0.8.1', {
            'checksums': ['6a6effda93f4e1ce9f618779b2dd1d9d84f1e32812c23a29b3fff6fd7f63fa5e'],
        }),
        # Optional profile plugin + dependency
        ('gviz-api', '1.10.0', {
            'source_tmpl': 'gviz_api-%(version)s.tar.gz',
            'checksums': ['846692dd8cc73224fc31b18e41589bd934e1cc05090c6576af4b4b26c2e71b90'],
        }),
        ('tensorboard-plugin-profile', '2.8.0', {
            'source_tmpl': 'tensorboard_plugin_profile-%(version)s.tar.gz',
            'checksums': ['58d65c979951e7628ca3b46c798e10e5813026a3ecfbacc319cf88ebe766134a'],
        }),
        (name, version, {
            'source_tmpl': 'v%(version)s.tar.gz',
            'source_urls': ['https://github.com/tensorflow/tensorflow/archive/'],
            'patches': [
                'TensorFlow-2.4.0_add-ldl.patch',
                'TensorFlow-2.4.0_dont-use-var-lock.patch',
                'TensorFlow-2.5.0_add-support-for-large-core-systems.patch',
                'TensorFlow-2.5.0_disable-avx512-extensions.patch',
                'TensorFlow-2.5.0-fix-alias-violation-in-absl.patch',
                'TensorFlow-2.5.0_fix-arm-vector-intrinsics.patch',
                'TensorFlow-2.5.0_fix-crash-on-shutdown.patch',
                'TensorFlow-2.7.1_fix_cpu_count.patch',
                'TensorFlow-2.7.1_remove-duplicate-gpu-tests.patch',
                'TensorFlow-2.8.4_exclude-xnnpack-on-ppc.patch',
                'TensorFlow-2.8.4_fix-PPC-JIT.patch',
                'TensorFlow-2.8.4_remove-libclang-and-io-gcs-deps.patch',
                'TensorFlow-2.8.4_resolve-gcc-symlinks.patch',
            ],
            'checksums': [
                {'v2.8.0.tar.gz': '66b953ae7fba61fd78969a2e24e350b26ec116cf2e6a7eb93d02c63939c6f9f7'},
                {'TensorFlow-2.4.0_add-ldl.patch': '917ee7282e782e48673596d8917c3207e60e0851bb9acf230a2a439b067af2e3'},
                {'TensorFlow-2.4.0_dont-use-var-lock.patch':
                 'b14f2493fd2edf79abd1c4f2dde6c98a3e7d5cb9c25ab9386df874d5f072d6b5'},
                {'TensorFlow-2.5.0_add-support-for-large-core-systems.patch':
                 '915f3477d6407fafd48269fe1e684a05ce361d9b9b85e58686682df87760f636'},
                {'TensorFlow-2.5.0_disable-avx512-extensions.patch':
                 '3655ce24c97569ac9738c07cac85347ba6f5c815ada95b19b606ffa46d4dda03'},
                {'TensorFlow-2.5.0-fix-alias-violation-in-absl.patch':
                 '12454fda3330fb45cd380377e283f04488b40e0b8ae7378e786ddf731a581f75'},
                {'TensorFlow-2.5.0_fix-arm-vector-intrinsics.patch':
                 '5edea55ce87d5adb14f6ed6996f308879e268b8cec760cf11288e3a56179a029'},
                {'TensorFlow-2.5.0_fix-crash-on-shutdown.patch':
                 '578c7493221ebd3dc25ca43d63a72cbb28fdf4112b1e2baa7390f25781bd78fd'},
                {'TensorFlow-2.7.1_fix_cpu_count.patch':
                 '5427a4cff0afc2fe5b24776ae9ca3616c56a79c1fde0025b37bec24837bb0698'},
                {'TensorFlow-2.7.1_remove-duplicate-gpu-tests.patch':
                 'f78526a34d85f4dda59b160b576aa3a3126db7073a58c4e3b1424923f6a21483'},
                {'TensorFlow-2.8.4_exclude-xnnpack-on-ppc.patch':
                 'ebd404ac56cc4ca662483f1f591e62e11749361be57a7ba5f4b2f0d03e829884'},
                {'TensorFlow-2.8.4_fix-PPC-JIT.patch':
                 '27d28293105b4dd0a25f58346c68b672f57215756f14a97c442d0e3317e93a2b'},
                {'TensorFlow-2.8.4_remove-libclang-and-io-gcs-deps.patch':
                 '10a6daad2ef65d649a6d4a6659f97f833359b3c35d3ce15e63d2db2e331a6a35'},
                {'TensorFlow-2.8.4_resolve-gcc-symlinks.patch':
                 '43ce9acc6bffff68a31d2263d0064d272999b2e0a2c6546690287cd1c9c90f04'},
            ],
            'test_script': 'TensorFlow-2.x_mnist-test.py',
            'test_tag_filters_cpu': '-gpu,-tpu,-no_cuda_on_cpu_tap,-no_pip,-no_oss,-oss_serial,-benchmark-test,-v1only',
            'test_tag_filters_gpu': ('gpu,-no_gpu,-nogpu,-gpu_cupti,-no_cuda11,-no_pip,-no_oss,-oss_serial,'
                                     '-benchmark-test,-v1only'),
            'test_targets': [
                '//tensorflow/core/...',
                '-//tensorflow/core:example_java_proto',
                '-//tensorflow/core/example:example_protos_closure',
                '//tensorflow/cc/...',
                '//tensorflow/c/...',
                '//tensorflow/python/...',
                # Fails on some nodes but C API isn't installed anyway
                '-//tensorflow/c/eager:c_api_test_gpu',
                '-//tensorflow/c/eager:c_api_distributed_test',
                '-//tensorflow/c/eager:c_api_distributed_test_gpu',
                # Race condition with port picker: https://github.com/tensorflow/tensorflow/issues/46602
                '-//tensorflow/c/eager:c_api_cluster_test_gpu',
                '-//tensorflow/c/eager:c_api_remote_function_test_gpu',
                '-//tensorflow/c/eager:c_api_remote_test_gpu',
                # Fails to open its own test.xml(?)
                '-//tensorflow/core/common_runtime:collective_param_resolver_local_test',
                # Fails on non-AVX-512 systems: https://github.com/tensorflow/tensorflow/issues/46532
                '-//tensorflow/core/common_runtime:mkl_layout_pass_test',
                '-//tensorflow/core/kernels/mkl:mkl_fused_ops_test',
                # Fails on AMD EPYC systems: https://github.com/tensorflow/tensorflow/issues/52151
                '-//tensorflow/core/kernels/mkl:mkl_fused_batch_norm_op_test',
                # All tests in this directory fail with segfault (TensorFlow Graph IR)
                '-//tensorflow/core/ir/importexport/tests/roundtrip/...',
                # Fails on e.g. AMD EPYC and known to be flaky on Mac: https://github.com/tensorflow/tensorflow/issues/56717
                '-//tensorflow/python/data/experimental/kernel_tests/service:fault_tolerance_test',
            ],
            'testopts': "--test_timeout=3600 --test_size_filters=small",
            'testopts_gpu': "--test_timeout=3600 --test_size_filters=small " +
                            "--run_under=//tensorflow/tools/ci_build/gpu_build:parallel_gpu_execute",
            'with_xla': True,
        }),
    ]
    
    moduleclass = 'lib'

@yqshao
Copy link

yqshao commented Apr 16, 2024

You probably need something like:

    ('tf_estimator_nightly', '2.8.0.dev2021122109', {
        'modulename': 'tensorflow_estimator',
        'source_tmpl': 'xxxx',
        'checksumes': ['xxxx']})

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants