-------------------------------------------------------------------------- By default, for Open MPI 4.0 and later, infiniband ports on a device are not used by default. The intent is to use UCX for these devices. You can override this policy by setting the btl_openib_allow_ib MCA parameter to true. Local host: acn72 Local adapter: mlx5_0 Local port: 1 -------------------------------------------------------------------------- -------------------------------------------------------------------------- WARNING: There was an error initializing an OpenFabrics device. Local host: acn72 Local device: mlx5_0 -------------------------------------------------------------------------- corrupted size vs. prev_size while consolidating [acn72:2225516] *** Process received signal *** [acn72:2225516] Signal: Aborted (6) [acn72:2225516] Signal code: (-6) [acn72:2225516] [ 0] /lib/x86_64-linux-gnu/libc.so.6(+0x43090)[0x7fa589b4a090] [acn72:2225516] [ 1] /lib/x86_64-linux-gnu/libc.so.6(gsignal+0xcb)[0x7fa589b4a00b] [acn72:2225516] [ 2] /lib/x86_64-linux-gnu/libc.so.6(abort+0x12b)[0x7fa589b29859] [acn72:2225516] [ 3] /lib/x86_64-linux-gnu/libc.so.6(+0x8d26e)[0x7fa589b9426e] [acn72:2225516] [ 4] /lib/x86_64-linux-gnu/libc.so.6(+0x952fc)[0x7fa589b9c2fc] [acn72:2225516] [ 5] /lib/x86_64-linux-gnu/libc.so.6(+0x9704e)[0x7fa589b9e04e] [acn72:2225516] [ 6] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN6ReaxFF16Read_Force_FieldEPKcPNS_16reax_interactionEPNS_14control_paramsEP19ompi_communicator_t+0xa21)[0x7fa54d965371] [acn72:2225516] [ 7] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS10PairReaxFF5coeffEiPPc+0x7c)[0x7fa54d95ed6c] [acn72:2225516] [ 8] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS5Input10pair_coeffEv+0x32d)[0x7fa54d40f7ad] [acn72:2225516] [ 9] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS5Input15execute_commandEv+0x7f1)[0x7fa54d414551] [acn72:2225516] [10] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS5Input3oneERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0xa0)[0x7fa54d415460] [acn72:2225516] [11] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS15KimInteractions8do_setupEiPPc+0xf99)[0x7fa54d69d7a9] [acn72:2225516] [12] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS15KimInteractions7commandEiPPc+0x55)[0x7fa54d69e365] [acn72:2225516] [13] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS10KimCommand7commandEiPPc+0x406)[0x7fa54d695646] [acn72:2225516] [14] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS5Input15execute_commandEv+0xc0f)[0x7fa54d41496f] [acn72:2225516] [15] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS5Input3oneERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0xa0)[0x7fa54d415460] [acn72:2225516] [16] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS10Deprecated7commandEiPPc+0x52f)[0x7fa54d25ee9f] [acn72:2225516] [17] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS5Input15execute_commandEv+0xc0f)[0x7fa54d41496f] [acn72:2225516] [18] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(_ZN9LAMMPS_NS5Input3oneERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE+0xa0)[0x7fa54d415460] [acn72:2225516] [19] /usr/local/lib/python3.8/dist-packages/lammps/liblammps.so(lammps_command+0x92)[0x7fa54d453892] [acn72:2225516] [20] /lib/x86_64-linux-gnu/libffi.so.7(+0x6ff5)[0x7fa55469cff5] [acn72:2225516] [21] /lib/x86_64-linux-gnu/libffi.so.7(+0x640a)[0x7fa55469c40a] [acn72:2225516] [22] /usr/lib/python3.8/lib-dynload/_ctypes.cpython-38-x86_64-linux-gnu.so(_ctypes_callproc+0x5b6)[0x7fa5546bb306] [acn72:2225516] [23] /usr/lib/python3.8/lib-dynload/_ctypes.cpython-38-x86_64-linux-gnu.so(+0x13ae7)[0x7fa5546bbae7] [acn72:2225516] [24] python3(_PyObject_MakeTpCall+0x296)[0x5d6066] [acn72:2225516] [25] python3(_PyEval_EvalFrameDefault+0x6329)[0x54ce69] [acn72:2225516] [26] python3(_PyFunction_Vectorcall+0x1b6)[0x5d5846] [acn72:2225516] [27] python3(_PyEval_EvalFrameDefault+0x907)[0x547447] [acn72:2225516] [28] python3(_PyEval_EvalCodeWithName+0x26a)[0x54552a] [acn72:2225516] [29] python3[0x4e1bd0] [acn72:2225516] *** End of error message *** Aborted Command exited with non-zero status 134 {"realtime":4.06,"usertime":1.89,"systime":1.51,"memmax":139156,"memavg":0}