-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compilation Error. #3
Comments
Same problem. I am on the WSL Ubuntu 18.04 NVIDIA 4090 system, same error under same conditions.
Also looking for ideas to fix it, thanks a lot. |
I encountered the same issue. When comparing against the flingbot codebase (on which this codebase is based) I found that Manually copying the |
@tlpss no major modifications were made, so copying them over should be fine. let me know if you run into any related issues. |
Can confirm that copying the files worked fine for building the simulator. I encountered a few other issues during the installation of the codebase, but got the task generation to work in the end. Haven't tried training/running the models. |
I also encountered the same problem. My system is ubuntu18.04. Can you tell me where to manually copy the lib and external folders? |
Have you solved this problem yet? |
I think it should be from Flingbot https://github.com/columbia-ai-robotics/flingbot, but just to make sure @sashank-tirumala could you point us to where you copied the files from? |
You should take a look at the PyFlex folder from the flingbot codebase here. That worked for me to compile pyflex and make some modifications to the bindings. |
Thank you very much for your reply. I have successfully compiled according to your guidance, but there are still some errors reported during the evaluation and training. Have you run through the author's training and evaluation code? |
Thank you for your guidance |
@zcswdt I did encounter a few issues when I tried to run training, but did not look into them as I was mostly interested in the data generation part. |
May I ask if you have successfully run the code for the author's training section? When I was training the training instructions provided by the author, I found that as the number of training steps increased, the program would consume memory until all memory was consumed, and then the training program was killed. (Process finished with exit code 137 (interrupted by signal 9: SIGKILL).I don't know what caused this, it's really helpless. Please help me, thank you! |
@zcswdt I'm afraid I won't be able to help.. As mentioned before, I have also had some issues with the training script, but have not looked into them properly as I was focused on the data generation. |
Sorry @zcswdt, I don't have access to my original setup anymore. If you're having memory issues, have you been able to try using fewer processes / shrinking the network? |
Thank you for your reply. Setting the parameter num_processes to 8 will also consume memory. Today, I will adjust it to 1 to see the situation. How to shrink the network? I don't quite understand this |
Thank you very much for your reply. I have actually completed the training, but it will consume my memory. |
If the memory issue is due to the neural network being trained simultaneously with the simulation, then you may be able to set the network to have fewer parameters, I doubt it will change things by much |
Thank you very much for your reply. Yesterday, I tried to evaluate the code again and found that it not only consumes memory for training, but also estimates approximately 400 data_size when using the evaluation model you provided When it comes to size, it also consumes my memory and causes the problem of killing the program. And during the evaluation, I added num_ process is also set to 1. I really don't know what to do. Your code is too important to me. Can you help me check it? My computer environment is strictly built according to the requirements on your readme. Of course, if possible, remotely control my computer and check my problem. Thank you very much. I really hope for your help. I have been trying this project for three months. Looking forward to your reply |
I am on Ubuntu 22.04 NVIDIA 3090 System and have successfully compile PyFlex in the past multiple times. When running compile.sh I get the following error:
This is inside the docker container you guys provided. I think the relevant line to look at is:
make[2]: *** No rule to make target '/workspace/cloth-funnels/PyFlex/lib/linux64/NvFlexExtReleaseCUDA_x64.a', needed by 'pyflex.cpython-39-x86_64-linux-gnu.so'. Stop.
Seems like I am missing a makefile. Any idea how to fix this?
Thanks
The text was updated successfully, but these errors were encountered: