Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Floating point exception #1

Open
drewhemm opened this issue May 9, 2021 · 2 comments
Open

Floating point exception #1

drewhemm opened this issue May 9, 2021 · 2 comments

Comments

@drewhemm
Copy link

drewhemm commented May 9, 2021

Hi,

I have compiled this and attempted to run it on a server, but I get the following:

# ./hdrdmacp -s
Looking for IB devices ...

=============================================
Found 1 devices
---------------------------------------------
   device 0 : mlx4_0 : uverbs0 : IB : InfiniBand channel adapter : Num. ports=2 : port num=1 : lid=1
=============================================

Device mlx4_0 opened. num_comp_vectors=96
Port attributes:
           state: 4
         max_mtu: 5
      active_mtu: 5
  port_cap_flags: 38865000
      max_msg_sz: 1073741824
    active_width: 2
    active_speed: 4
      phys_state: 5
      link_layer: 1
Floating point exception

Any ideas what could be wrong? I have searched in the source code and float is not explicitly used. I'll continue to have a look to see if I can spot where the problem is.

@drewhemm
Copy link
Author

drewhemm commented May 9, 2021

Went through the code line-by-line and found the problem here:

https://github.com/JeffersonLab/hdrdmacp/blob/master/hdRDMA.cc#L143

buff_section_len = (buff_len_GB*1000000000)/(uint64_t)num_buff_sections;

Basically, the -n and -m flags are required for the server, not optional:

$ hdrdmacp -s -n 4 -m 8GB
Looking for IB devices ...

=============================================
Found 1 devices
---------------------------------------------
   device 0 : mlx4_0 : uverbs0 : IB : InfiniBand channel adapter : Num. ports=2 : port num=1 : lid=2
=============================================

Device mlx4_0 opened. num_comp_vectors=32
Port attributes:
           state: 4
         max_mtu: 5
      active_mtu: 5
  port_cap_flags: 38865002
      max_msg_sz: 1073741824
    active_width: 2
    active_speed: 4
      phys_state: 5
      link_layer: 1
Created 4 buffers of 2000MB (8GB total)
Listening for connections on port ... 10470

@GalaxyBangBang
Copy link

Thank you very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants