How to minimize data risk for generative AI and LLMs in the enterprise – VentureBeat
Head over to our on-demand library to view sessions from VB Transform 2023. Register Here
Enterprises have quickly recognized the power of generative AI to uncover new ideas and increase both developer and non-developer productivity. But pushing sensitive and proprietary data into publicly hosted large language models (LLMs) creates significant risks in security, privacy and governance. Businesses need to address these risks before they can start to see any benefit from these powerful new technologies.
As IDC notes, enterprises have legitimate concerns that LLMs may learn from their prompts and disclose proprietary information to other businesses that enter similar prompts. Businesses also worry that any sensitive data they share could be stored online and exposed to hackers or accidentally made public.
That makes feeding data and prompts into publicly hosted LLMs a nonstarter for most enterprises, especially those operating in regulated spaces. So, how can companies extract value from LLMs while sufficiently mitigating the risks?
Instead of sending your data out to an LLM, bring the LLM to your data. This is the model most enterprises will use to balance the need for innovation with the importance of keeping customer PII and other sensitive data secure. Most large businesses already maintain a strong security and governance boundary around their data, and they should host and deploy LLMs within that protected environment. This allows data teams to further develop and customize the LLM and employees to interact with it, all within the organizations existing security perimeter.
VB Transform 2023 On-Demand
Did you miss a session from VB Transform 2023? Register to access the on-demand library for all of our featured sessions.
A strong AI strategy requires a strong data strategy to begin with. That means eliminating silos and establishing simple, consistent policies that allow teams to access the data they need within a strong security and governance posture. The end goal is to have actionable, trustworthy data that can be accessed easily to use with an LLM within a secure and governed environment.
LLMs trained on the entire web present more than just privacy challenges. Theyre prone to hallucinations and other inaccuracies and can reproduce biases and generate offensive responses that create further risk for businesses. Moreover, foundational LLMs have not been exposed to your organizations internal systems and data, meaning they cant answer questions specific to your business, your customers and possibly even your industry.
The answer is to extend and customize a model to make it smart about your own business. While hosted models like ChatGPT have gotten most of the attention, there is a long and growing list of LLMs that enterprises can download, customize, and use behind the firewall including open-source models like StarCoder from Hugging Face and StableLM from Stability AI. Tuning a foundational model on the entire web requires vast amounts of data and computing power, but as IDC notes, once a generative model is trained, it can be fine-tuned for a particular content domain with much less data.
An LLM doesnt need to be vast to be useful. Garbage in, garbage out is true for any AI model, and enterprises should customize models using internal data that they know they can trust and that will provide the insights they need. Your employees probably dont need to ask your LLM how to make a quiche or for Fathers Day gift ideas. But they may want to ask about sales in the Northwest region or the benefits a particular customers contract includes. Those answers will come from tuning the LLM on your own data in a secure and governed environment.
In addition to higher-quality results, optimizing LLMs for your organization can help reduce resource needs. Smaller models targeting specific use cases in the enterprise tend to require less compute power and smaller memory sizes than models built for general-purpose use cases or a large variety of enterprise use cases across different verticals and industries. Making LLMs more targeted for use cases in your organization will help you run LLMs in a more cost-effective, efficient way.
Tuning a model on your internal systems and data requires access to all the information that may be useful for that purpose, and much of this will be stored in formats besides text. About 80% of the worlds data is unstructured, including company data such as emails, images, contracts and training videos.
That requires technologies like natural language processing to extract information from unstructured sources and make it available to your data scientists so they can build and train multimodal AI models that can spot relationships between different types of data and surface these insights for your business.
This is a fast-moving area, and businesses must use caution with whatever approach they take to generative AI. That means reading the fine print about the models and services they use and working with reputable vendors that offer explicit guarantees about the models they provide. But its an area where companies cannot afford to stand still, and every business should be exploring how AI can disrupt its industry. Theres a balance that must be struck between risk and reward, and by bringing generative AI models close to your data and working within your existing security perimeter, youre more likely to reap the opportunities that this new technology brings.
Torsten Grabs is senior director of product management at Snowflake.
Welcome to the VentureBeat community!
DataDecisionMakers is where experts, including the technical people doing data work, can share data-related insights and innovation.
If you want to read about cutting-edge ideas and up-to-date information, best practices, and the future of data and data tech, join us at DataDecisionMakers.
You might even considercontributing an articleof your own!
Read More From DataDecisionMakers
Continue reading here:
How to minimize data risk for generative AI and LLMs in the enterprise - VentureBeat
- Debate over future of US AI regulation hinges on broadband funding - Reuters - June 26th, 2025 [June 26th, 2025]
- Forget about AI costs: Google just changed the game with open-source Gemini CLI that will be free for most developers - VentureBeat - June 26th, 2025 [June 26th, 2025]
- How ChatGPT and other AI tools are changing the teaching profession - AP News - June 26th, 2025 [June 26th, 2025]
- AI valuations are verging on the unhinged - The Economist - June 26th, 2025 [June 26th, 2025]
- Newly minted PhDs in AI nabbing six- and seven-figure paydays - Fortune - June 26th, 2025 [June 26th, 2025]
- Ring debuts Video Descriptions, Gen AI-powered updates on whats happening at home - AboutAmazon.com - June 26th, 2025 [June 26th, 2025]
- AI Regulations: Lawmaker Says Ban on State AI Rules Will Survive in Some Version in Budget Bill - PYMNTS.com - June 26th, 2025 [June 26th, 2025]
- Blacklisted by the U.S. and backed by Beijing, this Chinese AI startup has caught OpenAI's attention - CNBC - June 26th, 2025 [June 26th, 2025]
- 15 new jobs AI is creating - including 'Synthetic reality producer' - ZDNET - June 26th, 2025 [June 26th, 2025]
- Ohio man used AI-generated porn to harass exes and their moms, prosecutors say - The Columbus Dispatch - June 26th, 2025 [June 26th, 2025]
- Over 40% of agentic AI projects will be scrapped by 2027, Gartner says - Reuters - June 26th, 2025 [June 26th, 2025]
- Flood of AI-generated resumes causes chaos for recruiters, who resort to AI to screen them - Mashable - June 26th, 2025 [June 26th, 2025]
- And Now Malware That Tells AI to Ignore It? - Dark Reading - June 26th, 2025 [June 26th, 2025]
- Walmart unveils new AI tools for workers. Here's what they'll do. - USA Today - June 26th, 2025 [June 26th, 2025]
- Meet Project Rainier, Amazons one-of-a-kind machine ushering in the next generation of AI - AboutAmazon.com - June 26th, 2025 [June 26th, 2025]
- NHL AI mock draft: AI predicts the first round of the 2025 NHL Draft - USA Today - June 26th, 2025 [June 26th, 2025]
- Anthropic destroyed millions of print books to build its AI models - Ars Technica - June 26th, 2025 [June 26th, 2025]
- Satya Nadella: The hardest part of AI isn't the tech. It's getting people to change how they work. - Business Insider - June 26th, 2025 [June 26th, 2025]
- Microsoft sued by authors over use of books in AI training - Reuters - June 26th, 2025 [June 26th, 2025]
- Sitchs new dating app fuses human matchmaking and AI - TechCrunch - June 26th, 2025 [June 26th, 2025]
- Japanese company using mee-AI-ow to detect stressed cats - theregister.com - June 26th, 2025 [June 26th, 2025]
- Hertz Is Using AI to Scan Your Rental Car for Damage, and It Might Cost You - Car and Driver - June 26th, 2025 [June 26th, 2025]
- Bipartisan bill seeks to ban Chinese AI from federal agencies, as U.S. vows to win the AI race - ABC News - Breaking News, Latest News and Videos - June 26th, 2025 [June 26th, 2025]
- AI Agents Are Getting Better at Writing Codeand Hacking It as Well - WIRED - June 26th, 2025 [June 26th, 2025]
- Rubrik to Acquire Predibase to Accelerate Agentic AI Adoption - Business Wire - June 26th, 2025 [June 26th, 2025]
- IBM sees enterprise customers are using 'everything' when it comes to AI, the challenge is matching the LLM to the right use case - VentureBeat - June 26th, 2025 [June 26th, 2025]
- Hundreds of MCP Servers Expose AI Models to Abuse, RCE - Dark Reading - June 26th, 2025 [June 26th, 2025]
- Amazon's Ring can now use AI to 'learn the routines of your residence' - theregister.com - June 26th, 2025 [June 26th, 2025]
- Apple Will Need to Leave Its M&A Comfort Zone to Succeed in AI - Bloomberg.com - June 24th, 2025 [June 24th, 2025]
- An AI video ad is making a splash. Is it the future of advertising? - NPR - June 24th, 2025 [June 24th, 2025]
- Should consumers and businesses use AI assistants? - Brookings - June 24th, 2025 [June 24th, 2025]
- I asked AI, Google Flights and a travel agent to find me the cheapest flight. Heres who won. - MarketWatch - June 24th, 2025 [June 24th, 2025]
- NotebookLM Is Still the Best AI Tool You're Missing Out On - CNET - June 24th, 2025 [June 24th, 2025]
- Meta Held Deal Talks With Startup Runway in AI Recruiting Push - Bloomberg.com - June 24th, 2025 [June 24th, 2025]
- The rise of the personal AI advisors - Fast Company - June 24th, 2025 [June 24th, 2025]
- OpenAIs first AI device with Jony Ive wont be a wearable - The Verge - June 24th, 2025 [June 24th, 2025]
- Court filings reveal OpenAI and ios early work on an AI device - TechCrunch - June 24th, 2025 [June 24th, 2025]
- MrBeast used AI to create YouTube thumbnails. People werent pleased - Fast Company - June 24th, 2025 [June 24th, 2025]
- AI is coming to the NFL, and it could transform the game - The New York Times - June 24th, 2025 [June 24th, 2025]
- Amazon to Invest Around $54 Billion in U.K. to Support Innovation, AI Push - WSJ - June 24th, 2025 [June 24th, 2025]
- This theory about Jony Ives AI hardware device seems increasingly likely - 9to5Mac - June 24th, 2025 [June 24th, 2025]
- MAGA Is Split Over the AI Provision in Trump's Big Beautiful Bill - Business Insider - June 24th, 2025 [June 24th, 2025]
- 5 Dividend Stocks Poised to Profit From the AI Efficiency Boom - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- Here are the overlooked ways to play AI, crypto and quantum trends, says this tech investor - MarketWatch - June 24th, 2025 [June 24th, 2025]
- Microsoft to Cut Thousands of Jobs as AI Spending Surges - Yahoo Finance - June 24th, 2025 [June 24th, 2025]
- The Oversight Board calls Meta's uneven AI moderation 'incoherent and unjustifiable' - Engadget - June 24th, 2025 [June 24th, 2025]
- 3 Phenomenal AI Stocks That Investors Should Load Up On - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- Stock-Split Watch: Is This AI Stock That's Soared 300% Next on the List? - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- I Asked ChatGPT To Explain How To Make Money Using AI Heres What It Said - Nasdaq - June 24th, 2025 [June 24th, 2025]
- 2 Top AI Stocks to Sell Before They Fall 57% and 8%, According to These Wall Street Analysts - The Motley Fool - June 24th, 2025 [June 24th, 2025]
- AI's impact on the job market is inevitable, says workforce expert: 'It's going to hurt for certain parts of the population' - CNBC - June 24th, 2025 [June 24th, 2025]
- Leading AI models show up to 96% blackmail rate when their goals or existence is threatened, an Anthropic study says - Fortune - June 24th, 2025 [June 24th, 2025]
- Voters beware: 25 states restrict AI in elections. SC is in the other half. - News From The States - June 24th, 2025 [June 24th, 2025]
- Sphere Brings Its AI-Powered Mixed Reality to Vuzix Smart Glasses - Morningstar - June 24th, 2025 [June 24th, 2025]
- I've used Perplexity here's why it could be the perfect solution to Apples AI conundrum - TechRadar - June 24th, 2025 [June 24th, 2025]
- Opinion: Forget the Magnificent Seven these 7 cheap tech and AI stocks are better buys right now - MarketWatch - June 24th, 2025 [June 24th, 2025]
- Law firm says attorneys use of AI was isolated event - News From The States - June 24th, 2025 [June 24th, 2025]
- The cofounder of the viral AI 'cheating' startup Cluely says he only hires people for 2 jobs - Business Insider - June 24th, 2025 [June 24th, 2025]
- AI Is Power-Hungry, but It Could Eventually Cut More Emissions Than It Creates - Scientific American - June 24th, 2025 [June 24th, 2025]
- AI is about to change everything, including how we date. - Psychology Today - June 24th, 2025 [June 24th, 2025]
- Malicious AI willing to sacrifice human lives to avoid being shut down, shocking study reveals - New York Post - June 24th, 2025 [June 24th, 2025]
- Entrepreneur and investor Gary Vee's top tips to use and embrace AI - Fortune - June 24th, 2025 [June 24th, 2025]
- 5 things TV and movies promised AI can do that it can't yet - TechRadar - June 24th, 2025 [June 24th, 2025]
- Seattle to deploy AI to speed up housing and small business permit process - GeekWire - June 24th, 2025 [June 24th, 2025]
- AI-based brain-mapping software receives FDA market authorization - WashU Medicine - June 24th, 2025 [June 24th, 2025]
- Message from CEO Andy Jassy: Some thoughts on Generative AI - AboutAmazon.com - June 22nd, 2025 [June 22nd, 2025]
- Surge AI, the Hot Tech Startup Youve Probably Never Heard of, Is Already Outpacing Rivals - Inc.com - June 22nd, 2025 [June 22nd, 2025]
- Prediction: This Artificial Intelligence (AI) Data Center Stock Will Be Worth More Than Palantir by 2030 - Yahoo Finance - June 22nd, 2025 [June 22nd, 2025]
- Applebees and IHOP Plan to Introduce AI in Restaurants - WSJ - June 22nd, 2025 [June 22nd, 2025]
- 2 Artificial Intelligence (AI) Stocks That Could Soar in the Second Half of 2025 - The Motley Fool - June 22nd, 2025 [June 22nd, 2025]
- BBC threatens AI firm with legal action over unauthorised content use - BBC - June 22nd, 2025 [June 22nd, 2025]
- Chevron and Exxon Are the Next Hot AI Stocks. Heres Why. - Barron's - June 22nd, 2025 [June 22nd, 2025]
- Exclusive: Nvidia, Foxconn in talks to deploy humanoid robots at Houston AI server making plant - Reuters - June 22nd, 2025 [June 22nd, 2025]
- Bosses want you to know AI is coming for your job - The Washington Post - June 22nd, 2025 [June 22nd, 2025]
- Meta partners with sports eyewear brand Oakley to launch AI-powered glasses - Reuters - June 22nd, 2025 [June 22nd, 2025]
- Apple Executives Have Held Internal Talks About Buying AI Startup Perplexity - Bloomberg.com - June 22nd, 2025 [June 22nd, 2025]
- What Are the 5 Best Bargain Artificial Intelligence (AI) Stocks to Buy Right Now? - The Motley Fool - June 22nd, 2025 [June 22nd, 2025]
- Intel will outsource marketing to Accenture and AI, laying off many of its own workers - OregonLive.com - June 22nd, 2025 [June 22nd, 2025]
- I made an AI tool to run my job search, and it helped me get my dream role - Business Insider - June 22nd, 2025 [June 22nd, 2025]
- 1 AI Super Stock Is Starting to Rebound, but Shares Still Look Cheap - The Motley Fool - June 22nd, 2025 [June 22nd, 2025]