Guys, I have some C++ elliptic curve code that does things like point addition/subtraction and other stuff like that. The workload I'm running involves generating some public keys and saving them on the hard disk, in batches so as to not overwhelm the memory, but my friend told me it's too slow on CPU.
I think the performance will improve massively if I can get the elliptic curve codes converted to CUDA or OpenCL or something like that, but I'm having trouble finding a suitable library that won't force a full rewrite (and
this is no help). Alternatively if there is already a 256-bit secp256k1 toolbox somewhere on the internet, that would be fine also.