A secure approach to generative AI with AWS | Amazon Web Services – AWS Blog
Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. Customers are building generative AI applications using large language models (LLMs) and other foundation models (FMs), which enhance customer experiences, transform operations, improve employee productivity, and create new revenue channels.
FMs and the applications built around them represent extremely valuable investments for our customers. Theyre often used with highly sensitive business data, like personal data, compliance data, operational data, and financial information, to optimize the models output. The biggest concern we hear from customers as they explore the advantages of generative AI is how to protect their highly sensitive data and investments. Because their data and model weights are incredibly valuable, customers require them to stay protected, secure, and private, whether thats from their own administrators accounts, their customers, vulnerabilities in software running in their own environments, or even their cloud service provider from having access.
At AWS, our top priority is safeguarding the security and confidentiality of our customers workloads. We think about security across the three layers of our generative AI stack:
Each layer is important to making generative AI pervasive and transformative.
With the AWS Nitro System, we delivered a first-of-its-kind innovation on behalf of our customers. The Nitro System is an unparalleled computing backbone for AWS, with security and performance at its core. Its specialized hardware and associated firmware are designed to enforce restrictions so that nobody, including anyone in AWS, can access your workloads or data running on your Amazon Elastic Compute Cloud (Amazon EC2) instances. Customers have benefited from this confidentiality and isolation from AWS operators on all Nitro-based EC2 instances since 2017.
By design, there is no mechanism for any Amazon employee to access a Nitro EC2 instance that customers use to run their workloads, or to access data that customers send to a machine learning (ML) accelerator or GPU. This protection applies to all Nitro-based instances, including instances with ML accelerators like AWS Inferentia and AWS Trainium, and instances with GPUs like P4, P5, G5, and G6.
The Nitro System enables Elastic Fabric Adapter (EFA), which uses the AWS-built AWS Scalable Reliable Datagram (SRD) communication protocol for cloud-scale elastic and large-scale distributed training, enabling the only always-encrypted Remote Direct Memory Access (RDMA) capable network. All communication through EFA is encrypted with VPC encryption without incurring any performance penalty.
The design of the Nitro System has been validated by the NCC Group, an independent cybersecurity firm. AWS delivers a high level of protection for customer workloads, and we believe this is the level of security and confidentiality that customers should expect from their cloud provider. This level of protection is so critical that weve added it in our AWS Service Terms to provide an additional assurance to all of our customers.
From day one, AWS AI infrastructure and services have had built-in security and privacy features to give you control over your data. As customers move quickly to implement generative AI in their organizations, you need to know that your data is being handled securely across the AI lifecycle, including data preparation, training, and inferencing. The security of model weightsthe parameters that a model learns during training that are critical for its ability to make predictionsis paramount to protecting your data and maintaining model integrity.
This is why it is critical for AWS to continue to innovate on behalf of our customers to raise the bar on security across each layer of the generative AI stack. To do this, we believe that you must have security and confidentiality built in across each layer of the generative AI stack. You need to be able to secure the infrastructure to train LLMs and other FMs, build securely with tools to run LLMs and other FMs, and run applications that use FMs with built-in security and privacy that you can trust.
At AWS, securing AI infrastructure refers to zero access to sensitive AI data, such as AI model weights and data processed with those models, by any unauthorized person, either at the infrastructure operator or at the customer. Its comprised of three key principles:
The Nitro System fulfills the first principle of Secure AI Infrastructure by isolating your AI data from AWS operators. The second principle provides you with a way to remove administrative access of your own users and software to your AI data. AWS not only offers you a way to achieve that, but we also made it straightforward and practical by investing in building an integrated solution between AWS Nitro Enclaves and AWS Key Management Service (AWS KMS). With Nitro Enclaves and AWS KMS, you can encrypt your sensitive AI data using keys that you own and control, store that data in a location of your choice, and securely transfer the encrypted data to an isolated compute environment for inferencing. Throughout this entire process, the sensitive AI data is encrypted and isolated from your own users and software on your EC2 instance, and AWS operators cannot access this data. Use cases that have benefited from this flow include running LLM inferencing in an enclave. Until today, Nitro Enclaves operate only in the CPU, limiting the potential for larger generative AI models and more complex processing.
We announced our plans to extend this Nitro end-to-end encrypted flow to include first-class integration with ML accelerators and GPUs, fulfilling the third principle. You will be able to decrypt and load sensitive AI data into an ML accelerator for processing while providing isolation from your own operators and verified authenticity of the application used for processing the AI data. Through the Nitro System, you can cryptographically validate your applications to AWS KMS and decrypt data only when the necessary checks pass. This enhancement allows AWS to offer end-to-end encryption for your data as it flows through generative AI workloads.
We plan to offer this end-to-end encrypted flow in the upcoming AWS-designed Trainium2 as well as GPU instances based on NVIDIAs upcoming Blackwell architecture, which both offer secure communications between devices, the third principle of Secure AI Infrastructure. AWS and NVIDIA are collaborating closely to bring a joint solution to market, including NVIDIAs new NVIDIA Blackwell GPU platform, which couples NVIDIAs GB200 NVL72 solution with the Nitro System and EFA technologies to provide an industry-leading solution for securely building and deploying next-generation generative AI applications.
Today, tens of thousands of customers are using AWS to experiment and move transformative generative AI applications into production. Generative AI workloads contain highly valuable and sensitive data that needs the level of protection from your own operators and the cloud service provider. Customers using AWS Nitro-based EC2 instances have received this level of protection and isolation from AWS operators since 2017, when we launched our innovative Nitro System.
At AWS, were continuing that innovation as we invest in building performant and accessible capabilities to make it practical for our customers to secure their generative AI workloads across the three layers of the generative AI stack, so that you can focus on what you do best: building and extending the uses of the generative AI to more areas. Learn more here.
Anthony Liguori is an AWS VP and Distinguished Engineer for EC2
Colm MacCrthaigh is an AWS VP and Distinguished Engineer for EC2
Continued here:
A secure approach to generative AI with AWS | Amazon Web Services - AWS Blog
- Optimization of wear parameters for ECAP-processed ZK30 alloy using response surface and machine learning ... - Nature.com - April 22nd, 2024 [April 22nd, 2024]
- Machine learning approach predicts heart failure outcome risk - HealthITAnalytics.com - April 22nd, 2024 [April 22nd, 2024]
- Practical approaches in evaluating validation and biases of machine learning applied to mobile health studies ... - Nature.com - April 22nd, 2024 [April 22nd, 2024]
- Application of power-law committee machine to combine five machine learning algorithms for enhanced oil recovery ... - Nature.com - April 22nd, 2024 [April 22nd, 2024]
- Free tool uses machine learning to pick better molecules for testing new reactions - Chemical & Engineering News - April 22nd, 2024 [April 22nd, 2024]
- Automated Analysis of Nuclear Parameters in Oral Exfoliative Cytology Using Machine Learning - Cureus - April 22nd, 2024 [April 22nd, 2024]
- An AI Ethics Researcher's Take On The Future Of Machine Learning In The Art World - SlashGear - April 22nd, 2024 [April 22nd, 2024]
- Enhancing Emotion Recognition in Users with Cochlear Implant Through Machine Learning and EEG Analysis - Physician's Weekly - April 22nd, 2024 [April 22nd, 2024]
- Imageomics Applies AI and Vision Advancements to Biological Questions - Photonics.com - April 22nd, 2024 [April 22nd, 2024]
- Machine learning reveals the control mechanics of an insect wing hinge - Nature.com - April 22nd, 2024 [April 22nd, 2024]
- The Future of ML Development Services: Trends and Predictions - FinSMEs - April 22nd, 2024 [April 22nd, 2024]
- CSRWire - Island Conservation Harnesses Machine Learning Solutions From Lenovo and NVIDIA To Restore Island ... - CSRwire.com - April 22nd, 2024 [April 22nd, 2024]
- Investigation of the effectiveness of a classification method based on improved DAE feature extraction for hepatitis C ... - Nature.com - April 22nd, 2024 [April 22nd, 2024]
- Machine Learning Uncovers New Ways to Kill Bacteria With Non-Antibiotic Drugs - ScienceAlert - April 22nd, 2024 [April 22nd, 2024]
- Formal Interaction Model (FIM): A Mathematics-based Machine Learning Model that Formalizes How AI and Users Shape One Another - MarkTechPost - April 22nd, 2024 [April 22nd, 2024]
- Imbalanced Learn: the Python library for rebuilding ML datasets - DataScientest - April 22nd, 2024 [April 22nd, 2024]
- AI has a lot of terms. We've got a glossary for what you need to know - Quartz - April 22nd, 2024 [April 22nd, 2024]
- Texxa AI, Where ideas take flight: Revolutionizing AI Solutions for Businesses and Individuals - GlobeNewswire - April 22nd, 2024 [April 22nd, 2024]
- Using machine learning to identify patients with cancer that would benefit from immunotherapy - Medical Xpress - April 22nd, 2024 [April 22nd, 2024]
- Researchers at Stanford University Explore Direct Preference Optimization (DPO): A New Frontier in Machine Learning and Human Feedback - MarkTechPost - April 22nd, 2024 [April 22nd, 2024]
- Machine Learning Helps Scientists Locate the Neurological Origin of Psychosis - ExtremeTech - April 22nd, 2024 [April 22nd, 2024]
- Slack delivers native and secure generative AI powered by Amazon SageMaker JumpStart | Amazon Web Services - AWS Blog - April 22nd, 2024 [April 22nd, 2024]
- Accurate and rapid antibiotic susceptibility testing using a machine learning-assisted nanomotion technology platform - Nature.com - March 20th, 2024 [March 20th, 2024]
- AI reveals the complexity of a simple birdsong - The Washington Post - March 20th, 2024 [March 20th, 2024]
- Researchers from MIT and Harvard Developed UNITS: A Unified Machine Learning Model for Time Series Analysis that Supports a Universal Task... - March 20th, 2024 [March 20th, 2024]
- Undergraduate Researchers Help Unlock Lessons of Machine Learning and AI - College of Natural Sciences - March 20th, 2024 [March 20th, 2024]
- Machine Learning Accelerates the Simulation of Dynamical Fields - Eos - March 20th, 2024 [March 20th, 2024]
- Inter hospital external validation of interpretable machine learning based triage score for the emergency department ... - Nature.com - March 20th, 2024 [March 20th, 2024]
- HEAL: A framework for health equity assessment of machine learning performance - Google Research - March 20th, 2024 [March 20th, 2024]
- Expert on how machine learning could lead to improved outcomes in urology - Urology Times - March 20th, 2024 [March 20th, 2024]
- Unlock the potential of generative AI in industrial operations | Amazon Web Services - AWS Blog - March 20th, 2024 [March 20th, 2024]
- Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA ... - AWS Blog - March 20th, 2024 [March 20th, 2024]
- Orange isn't building its own AI foundation model here's why - Light Reading - March 20th, 2024 [March 20th, 2024]
- Wall Street's Favorite Machine Learning Stocks? 3 Names That Could Make You Filthy Rich - InvestorPlace - March 20th, 2024 [March 20th, 2024]
- Edge Impulse machine learning platform adds support for NVIDIA TAO Toolkit and Omniverse - CNX Software - March 20th, 2024 [March 20th, 2024]
- MIT Researchers Developed an Image Dataset that Allows Them to Simulate Peripheral Vision in Machine Learning Models - MarkTechPost - March 20th, 2024 [March 20th, 2024]
- 18 Cutting-Edge Artificial Intelligence Applications in 2024 - Simplilearn - March 20th, 2024 [March 20th, 2024]
- Machine-learning-based global optimization of microwave passives with variable-fidelity EM models and response ... - Nature.com - March 20th, 2024 [March 20th, 2024]
- Benchmarking machine learning and parametric methods for genomic prediction of feed efficiency-related traits in ... - Nature.com - March 20th, 2024 [March 20th, 2024]
- PyCaret: Everything you need to know about this Python library - DataScientest - March 20th, 2024 [March 20th, 2024]
- Crypto Entities That Neglect AI and Machine Learning Investment Will Lag Behind, Warns Binance CTO Bitcoin News - Bitcoin.com News - March 20th, 2024 [March 20th, 2024]
- VictoriaMetrics Machine Learning takes monitoring to the next level - The Bakersfield Californian - March 20th, 2024 [March 20th, 2024]
- How Marketers Can Elevate Creative Performance with AI-Driven Format Optimisation - ExchangeWire - March 20th, 2024 [March 20th, 2024]
- Revolutionizing carbon neutrality: Machine learning paves the way for advanced CO reduction catalysts - EurekAlert - March 20th, 2024 [March 20th, 2024]
- BurstAttention: A Groundbreaking Machine Learning Framework that Transforms Efficiency in Large Language Models with Advanced Distributed Attention... - March 20th, 2024 [March 20th, 2024]
- Construction of environmental vibration prediction model for subway transportation based on machine learning ... - Nature.com - March 20th, 2024 [March 20th, 2024]
- Introducing 'Get started with generative AI on AWS: A guide for public sector organizations' | Amazon Web Services - AWS Blog - March 20th, 2024 [March 20th, 2024]
- Generative deep learning for the development of a type 1 diabetes simulator | Communications Medicine - Nature.com - March 20th, 2024 [March 20th, 2024]
- Integrating core physics and machine learning for improved parameter prediction in boiling water reactor operations ... - Nature.com - March 11th, 2024 [March 11th, 2024]
- Top AI Certification Courses to Enroll in 2024 - Analytics Insight - March 11th, 2024 [March 11th, 2024]
- Machine learning techniques applied to construction: A hybrid bibliometric analysis of advances and future directions - ScienceDirect.com - March 11th, 2024 [March 11th, 2024]
- Artificial Intelligence Market towards a USD 2,745 bn by 2032 - Market.us Scoop - Market News - March 11th, 2024 [March 11th, 2024]
- Data Maturation Represents the Essential Reason for Deploying Machine Learning Today | By Adam Mogelonsky - Hospitality Net - March 11th, 2024 [March 11th, 2024]
- The Top 3 Machine Learning Stocks to Buy in March 2024 - InvestorPlace - March 11th, 2024 [March 11th, 2024]
- How to Learn the Math Needed for Data Science - Towards Data Science - March 11th, 2024 [March 11th, 2024]
- This AI Paper from Huawei Introduces DenseSSM: A Novel Machine Learning Approach to Enhance the Flow of Hidden Information between Layers in State... - March 11th, 2024 [March 11th, 2024]
- Machine learning and the prediction of suicide in psychiatric populations: a systematic review | Translational Psychiatry - Nature.com - March 11th, 2024 [March 11th, 2024]
- Machine learning algorithms show applications in OAB, antibiotic resistance - Urology Times - March 11th, 2024 [March 11th, 2024]
- Scientists develop new machine learning method for modeling chemical reactions - Phys.org - March 11th, 2024 [March 11th, 2024]
- Machine learning developed a CD8+ exhausted T cells signature for predicting prognosis, immune infiltration and drug ... - Nature.com - March 11th, 2024 [March 11th, 2024]
- Single Transit Detection In Kepler With Machine Learning And Onboard Spacecraft Diagnostics - Astrobiology - Astrobiology News - March 11th, 2024 [March 11th, 2024]
- Meta AI Proposes Wukong: A New Machine Learning Architecture that Exhibits Effective Dense Scaling Properties Towards a Scaling Law for Large-Scale... - March 11th, 2024 [March 11th, 2024]
- Putting the AI in NIA: New opportunities in artificial intelligence - National Institute on Aging - March 11th, 2024 [March 11th, 2024]
- Revolutionizing LLM Training with GaLore: A New Machine Learning Approach to Enhance Memory Efficiency without Compromising Performance - MarkTechPost - March 11th, 2024 [March 11th, 2024]
- Uncertainty-aware deep learning for trustworthy prediction of long-term outcome after endovascular thrombectomy ... - Nature.com - March 11th, 2024 [March 11th, 2024]
- AI Engineer Salary: The Lucrative World of AI Engineering - Simplilearn - March 11th, 2024 [March 11th, 2024]
- Multimodal artificial intelligence-based pathogenomics improves survival prediction in oral squamous cell carcinoma ... - Nature.com - March 11th, 2024 [March 11th, 2024]
- Northrop Grumman Partners to Advance Deep Sensing for the US Army | Northrop Grumman - Northrop Grumman Newsroom - March 11th, 2024 [March 11th, 2024]
- Global cellular IoT connections to grow 90% to 6.5 bn by 2028: Juniper Research - ETTelecom - March 11th, 2024 [March 11th, 2024]
- Enhancing statistical reliability of weather forecasts with machine learning - Phys.org - March 11th, 2024 [March 11th, 2024]
- Inside AI: Talking to the Data - Inside Unmanned Systems - March 11th, 2024 [March 11th, 2024]
- Anemond's Factoid 2 is an experimental sampler plugin that uses machine learning to "decompose", remix and ... - MusicRadar - March 11th, 2024 [March 11th, 2024]
- Advancing Chemistry with AI: New Model for Simulating Diverse Organic Reactions - Lab Manager Magazine - March 11th, 2024 [March 11th, 2024]
- Generative AI: Understand the challenges to realize the opportunities | Amazon Web Services - AWS Blog - March 11th, 2024 [March 11th, 2024]
- How To Specialize in Artificial Intelligence - Troy Today - Troy University - March 11th, 2024 [March 11th, 2024]
- Google DeepMind Introduces Two Unique Machine Learning Models, Hawk And Griffin, Combining Gated Linear Recurrences With Local Attention For Efficient... - March 11th, 2024 [March 11th, 2024]
- Unlocking Innovation: AWS and Anthropic push the boundaries of generative AI together | Amazon Web Services - AWS Blog - March 11th, 2024 [March 11th, 2024]
- Introducing Microsoft's AI Red Team And PyRIT - AiThority - March 11th, 2024 [March 11th, 2024]
- Unveiling the World of Artificial Intelligence: A Beginner's Guide - Medium - January 3rd, 2024 [January 3rd, 2024]
- How machine learning might unlock earthquake prediction - MIT Technology Review - January 3rd, 2024 [January 3rd, 2024]
Tags: