-
Notifications
You must be signed in to change notification settings - Fork 779
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LGSVL crash after ~2hrs of training #1705
Comments
Which version of the simulator are you using? |
the version we are using is: the log is below: |
This may be a memory leak. Do you have any more info on the tests you are running? |
@revati-naik this might relate to the api reset issue |
@EricBoiseLGSVL - i am using only 1 agent without any NPCs or peds. regarding to the api commends, i am simply trying to train an agent to steer/throttle using RL algorithm. and i mainly use the another thing is that i upgraded to svlsimulator-linux64-2021.2.2 and still has the same issue can you elaborate on the memory leak issue? |
Ok, I see the error in this log. Looks like the connection is lost and throws a socket exception. @heeen Would this issue be resolved with the new web socket changes we have for 2021.3?
|
@EricBoiseLGSVL any news on this subject? |
We are on code freeze this week. :) We need rounds of testing and then docs. Should be in a few weeks |
Hi, I'm training an RL agent in the Highway101GLE environment, and there is a Segmentation fault after a few thousand episodes of training. I also suspect that the reason for this is a memory leak. I'm currently using 2021.3, and I also get this SocketException. I've attached my log file as well. My environment: Any help is appreciated and let me know if you need more information! |
it looks like the socket is closed when trying to send the test report.
Can you disable the test report feature and see if you still get the crash? |
It also seems you are loading the cyber bridge multiple times. Can you post your sensors used and any changes you have made to source? How are you resetting the simulation. Please step through how you are running simulations because memory leaks are the hardest to trouble shoot. It may even be an issue with the Unity engine. |
I still get the segmentation error when the test report feature is disabled. I'm actually not using the cyber bridge, but just using the Python API. But if loading the cyber bridge is causing the problem, how can we disable that? For sensors, I currently use one main RGB camera and 3 depth sensor cameras. In my training script, I'll reset the simulation by calling env.reset(), which sets up the ego and npc vehicles, and resets collision and done flags, either after a fixed number of steps have passed, or when the vehicle experiences a collision. |
if you are not using the bridge then yes, use a vehicle that has no bridge. This might be the issue. |
May I ask, how do we remove the bridge from the vehicle? |
hey,
i am using SVL SIM for training RL agent using LGSVL GYM library .
and after about 200 episodes, the simulator crushes. it happens several times, and i cant train the agent properly due to does limitations
My environment:
NVIDIA Corporation GP102 [TITAN X]
31.3 GiB RAM
Intel® Core™ i9-10900K CPU @ 3.70GHz × 20
Ubuntu 20.04
let me know if there is some log that i can provide
The text was updated successfully, but these errors were encountered: