"Status 500: Unexpected Problem" when accessing grid's forecast #241
Replies: 21 comments 29 replies
-
I've noticed this, too. I added retry looping with exponential backoff to my code. Usually the second or third GET will succeed. |
Beta Was this translation helpful? Give feedback.
-
It's interesting...revisiting this problem 12 days later, the same grid is still producing the error. |
Beta Was this translation helpful? Give feedback.
-
I am also getting this problem even with 3 retries 1s apart. I am going to try backing off my retries, but I doubt that will help. I will say I am hitting a different endpoint though: https://api.weather.gov/gridpoints/MTR/83,153/forecast/hourly |
Beta Was this translation helpful? Give feedback.
-
Status 500 errors are operational outages and can be reported to the helpdesk for a more immediate response. The original request has been purged from the logs, so I can no longer determinen the cause. If anyone can provide a more recent correlation id then I will look into it. |
Beta Was this translation helpful? Give feedback.
-
Unfortunately my logging doesn't have a correlationId. I make my requests at 12am PST. Are their planned outages at certain times? I looked here and didn't see anything. |
Beta Was this translation helpful? Give feedback.
-
Here is a recently generated correlation ID for the issue: 4a3c06e7 The link in the original post for this issue actually always generates this error for me, which is interesting seeing that the grid is a fairly large city in California. |
Beta Was this translation helpful? Give feedback.
-
Ex. 500 status errors (unexpected problem), not intermittent ... the correlation ID is the first line in the 500 error response. https://api.weather.gov/gridpoints/MTR/106,127/forecast/hourly |
Beta Was this translation helpful? Give feedback.
-
A quick glance at the correlation ids indicate an issue with the service that translates lat/lon to grids. Support is typically able to address this issue. As always, please send all 500 errors to NCO support. |
Beta Was this translation helpful? Give feedback.
-
Still getting 500 frequently. The interesting point is when I access directly to https://api.weather.gov/gridpoints/ILM/94,68/forecast, it works but when I request from my application, it returns 500. |
Beta Was this translation helpful? Give feedback.
-
What threshold should we be reporting /gridpoints issues to NCO, being that its a known long-term issue? This is the counts of 500 errors for a call to
|
Beta Was this translation helpful? Give feedback.
-
I hate to say it, but that's the exact question to ask NCO. It has been too long. Priorities get shifted (at times even away from the API completely), but the team is finally growing and hopefully can tackle.some key issues soon. |
Beta Was this translation helpful? Give feedback.
-
I've been getting 500 error codes all day at https://api.weather.gov/gridpoints/DTX/42,58/forecast Quite annoying as this breaks several home assistant automations. |
Beta Was this translation helpful? Give feedback.
-
I'm getting the 500 error codes also correlationId: '298bf3d6', title: 'Unexpected Problem', type: 'https://api.weather.gov/problems/UnexpectedProblem', status: 500, detail: 'An unexpected problem has occurred.' |
Beta Was this translation helpful? Give feedback.
-
Seems the KC outage problem is bad enough now that I can't even check the status at https://www.nco.ncep.noaa.gov/status/messages/ It times out. |
Beta Was this translation helpful? Give feedback.
-
Good day. I just found this today. I am also having the same issues. What's interesting though is that I have better luck with the same forecast locations if I try super early in the AM. But as it gets later in the day (MT), it seems to get worse and worse. I'm about to try the repeat query idea, but I'm writing an app that will run in both Desktop and Mobile devices so am afraid of performance if I have to do retries. Makes me think the server processing these requests isn't supporting the number of requests being thrown at it? |
Beta Was this translation helpful? Give feedback.
-
This https://api.weather.gov/gridpoints/DTX/60,45/forecast does not work neither. { |
Beta Was this translation helpful? Give feedback.
-
I’ve had pretty good success with a 2 second delay on failure with a max of 3 retries |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
I submitted this as an operational issue today and thought I would share here in case it is helpful. The requests below were working earlier today but they have stopped working. There are other grid points that I am able to successfully access.
Any insights that can be provided are appreciated. |
Beta Was this translation helpful? Give feedback.
-
I have been experiencing this as well, pretty consistently requesting two reports with a script in quick succession has been failing, even with an exponential backoff. I'm not 100% sure this is the reason, but I believe not setting the headers tends to result in much more aggressive rate limiting: See Authentication spec After adding the headers it seems like I am very rarely if ever getting a 500 or even needing the backoff. If this is truly the reason for the 500s, it would be much more user friendly if a 401 unauthorized response or 429 rate limiting response requesting a user agent was provided. |
Beta Was this translation helpful? Give feedback.
-
There's been a couple of particularly bad days with 500 errors in the last week or so, enough that it generated traffic on the NCO-Ops master ticket. These are the error counts for my application for the last week (times as of midnight PDT):
The 500s have never been due to rate limiting -- it is a resource limitation on NWS's end -- but caching does play a factor, and user-agent is used in determining if a request is from the same client or not. It was mentioned in the NCO ticket that there was a facility failover event and the errors peaked up just after that event. My use is either a frequent poll of /alerts/active for a couple of zones or an hourly poll of forecast data for one zone. My poller does a retry after about 5 seconds, which is usually enough for whatever backend issue caused the first 500 to clear. My average error count is about 6-7 500 errors per day, counting the retry, counting since Sept. 1, 2023. |
Beta Was this translation helpful? Give feedback.
-
Describe the bug
Occasionally when accessing a grid's weather forecast, a status 500 error is returned. The error message indicates that an unexpected problem has occurred.
To Reproduce
The link currently providing this error is: https://api.weather.gov/gridpoints/MTR/114,128
If possible, also include the correlationId from the response
1dc193d9
Expected behavior
The expected JSON response for a grid's forecast.
Screenshots
data:image/s3,"s3://crabby-images/a99b3/a99b3058993cd1910dfb668d50039fcc71b965a6" alt="image"
Additional Context
I have noticed that this is a rather intermittent issue.
Thanks,
Chase
Beta Was this translation helpful? Give feedback.
All reactions