Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nearbyint fails with NVRTC compile error for integer inputs #2524

Open
IvanYashchuk opened this issue Feb 27, 2023 · 0 comments
Open

nearbyint fails with NVRTC compile error for integer inputs #2524

IvanYashchuk opened this issue Feb 27, 2023 · 0 comments

Comments

@IvanYashchuk
Copy link
Collaborator

🐛 Describe the bug

from nvfuser import FusionDefinition, DataType
import torch

with FusionDefinition() as fd:
    t1 = fd.define_tensor(symbolic_sizes=[-1], contiguous=[True], dtype=DataType.Int32)
    t2 = fd.ops.round(t1)
    fd.add_output(t2)

a = torch.ones(2, device="cuda", dtype=torch.int32)
fd.execute((a,))
CUDA NVRTC compile error: __tmp_kernel1.cu(9175): error: more than one instance of overloaded function "nearbyint" matches the argument list:
            function "nearbyint(double)"
__nv_nvrtc_builtin_header.h(156230): here
            function "nearbyint(float)"
__nv_nvrtc_builtin_header.h(157258): here
            argument types are: (int)

1 error detected in the compilation of "__tmp_kernel1.cu".

Versions

devel

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant