| Thread Tools |
2nd December 2020, 15:24 | #1 |
[M] Reviewer Join Date: May 2010 Location: Romania
Posts: 153,514
| Amazon was brought down by new servers Too many threads brought down the cloud Amazon Web Services (AWS) has revealed the actual cause of the massive outage that impacted thousands of online sites and services, including Amazon's own services, last week. According to the company, the outage was not driven by any memory problem in the network. Rather, it was triggered by the addition of new servers to the Amazon Kinesis real-time data processing service. Adding new capacity caused all servers in the Kinesis system to exceed the maximum number of 'threads' allowed by an operating system (OS) configuration. Servers in the Kinesis system need to generate threads between each other in the front-end fleet and when they couldn’t, the whole lot went tits up. This resulted in a series of other problems that eventually took down thousands of websites and services, including those from some big companies such as Adobe, Flickr, Roku, Twilio and Autodesk. AWS's own services were also affected, including ACM, Amplify Console, AppStream2, AppSync, Athena, Batch, CodeArtifact, CodeGuru Profiler, CodeGuru Reviewer, CloudFormation, CloudMap, CloudTrail, Connect, Comprehend, DynamoDB, Elastic Beanstalk, EventBridge, GuardDuty, IoT Services, Lambda, LEX, Macie, Managed Blockchain, Marketplace, MediaLive, MediaConvert, Personalize, RDS Performance Insights, Rekognition, SageMaker and Workspaces. The multi-hour outage affected the US-East-1 region, according to the company. Apparently it was all fixed by turning it off and turning it on again. Unfortunately, since that meant the entire Kinesis service, it took a while. Amazon has said sorry for the outage and said it would apply lessons learned to further improve the reliability of its services. In the short term, the company plans to move to servers with more powerful CPUs and more and memory to help it reduce the number of servers and the thread count across the fleet. It is also carrying tests to increase thread count limits in OS configuration. AWS believes the measure will give additional safety margin by providing more threads per server. The company also plans to introduce lots of other changes to "radically improve the cold-start time for the front-end fleet". https://fudzilla.com/news/memory-and...by-new-servers |
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Former Aussie PM hacked and brought down by Hope | Stefan Mileschin | WebNews | 0 | 23rd September 2020 09:03 |
How Valve brought 'Half-Life' to VR | Stefan Mileschin | WebNews | 0 | 2nd April 2020 10:32 |
2018 brought the electric car to everyone | Stefan Mileschin | WebNews | 0 | 20th December 2018 09:22 |
How 'NieR' was brought back from the dead | Stefan Mileschin | WebNews | 0 | 16th February 2017 06:17 |
The Das Keyboard 5Q brought notifications to my fingertips | Stefan Mileschin | WebNews | 0 | 6th January 2017 08:45 |
Sony brought new PS4 accessories too | Stefan Mileschin | WebNews | 0 | 9th September 2016 10:39 |
Amazon Prime Now brought me candy in 23 minutes | Stefan Mileschin | WebNews | 0 | 19th December 2014 10:20 |
Windows 7 SP1 will be brought forward | jmke | WebNews | 1 | 10th March 2010 14:50 |
Table Tennis on XBOX360 from the guys who brought you GTA | jmke | WebNews | 0 | 20th June 2006 09:32 |
Look what the mailman brought me! | Turbokeu | Hardware Overclocking and Case Modding | 14 | 30th August 2003 14:13 |
Thread Tools | |
| |