The content on this page was provided by an independent third party and syndicated by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AIM Intelligence and BMW Group Examine Gaps in Evaluating Enterprise AI Policy Compliance

Research reveals LLMs follow allowlist policies but systematically fail to enforce organizational prohibitions, exposing a critical gap in enterprise AI safety

SF, CA, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Seoul, South Korea / Munich, Germany – January 2026 – BMW Group and AIM Intelligence, a leading AI safety startup, today announced the publication of COMPASS (Company/Organization Policy Alignment Assessment), the first systematic framework for evaluating whether large language models (LLMs) comply with organization-specific policies. The research, now available on arXiv, reveals a critical gap that remains under-measured in current evaluation practices: models that pass standard safety benchmarks often fail dramatically when enforcing the nuanced, context-dependent rules that govern real-world business operations.

Why Enterprise AI Policies Break Down in Practice

As organizations across healthcare, finance, automotive, and government sectors rapidly adopt LLMs for customer-facing applications, the research team discovered a fundamental asymmetry that poses significant risks for policy-critical deployments.
Key Findings:
Strong Allowlist Compliance: Models reliably handle legitimate requests with over 95% accuracy
Critical Denylist Failures: Models fail to correctly refuse prohibited requests in up to 97% of cases
Catastrophic Adversarial Vulnerability: Under adversarial conditions, some models refuse fewer than 5% of policy-violating requests
“Most AI safety tests focus on whether a model behaves safely in general,” said Dasol Choi, AI Safety Researcher at AIM Intelligence. “COMPASS looks at a more practical question: can an AI system reliably follow the specific rules of an organization? Our findings show that, in many real-world deployments today, the answer is often no.”

Why Generic AI Safety Isn’t Enough

The research addresses a critical disconnect between how AI systems are evaluated and how they are deployed. While existing safety benchmarks focus on universal harms such as toxicity and violence, real enterprises operate under complex internal policies—compliance manuals, operational playbooks, legal edge cases, and brand-specific constraints.
COMPASS evaluates models across four dimensions that typical benchmarks ignore:
1. Policy Selection: Can the model identify which policy applies to a given situation?
2. Policy Interpretation: Can it reason through conditionals, exceptions, and vague clauses?
3. Conflict Resolution: When rules collide, does the model resolve conflicts as the organization intends?
4. Justification: Can the model ground its decisions in actual policy text?

“Our evaluation revealed a striking asymmetry,” noted DongGeon Lee, AI Safety Researcher at AIM Intelligence. “While models achieve near-perfect accuracy on what they can do, they remain structurally vulnerable in enforcing what they must not do. This gap persists across model scales and architectures, indicating that scaling alone cannot solve the problem.”

Industry-Scale Validation

The research team applied COMPASS across eight diverse industry scenarios—Automotive, Government, Financial, Healthcare, Travel, Telecom, Education, and Recruiting—generating and validating 5,920 queries that test both routine compliance and adversarial robustness. Fifteen state-of-the-art models were evaluated, including leading proprietary and open-source systems.

Making Misalignment Measurable

Perhaps the most significant contribution of COMPASS is transforming alignment from a philosophical concern into an engineering problem. The framework and benchmark datasets are publicly available on GitHub and Hugging Face, enabling organizations to evaluate their AI systems against their own policies.

About the Research Collaboration

This research represents a collaboration between AIM Intelligence, BMW Group, Yonsei University, Pohang University of Science and Technology, and Seoul National University. The full paper, “COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs,” is available at https://arxiv.org/abs/2601.01836.

About AIM Intelligence

AIM Intelligence is a Seoul-based AI safety company specializing in automated red-teaming, real-time guardrails, and AI monitoring solutions. Founded in 2024, AIM Intelligence serves major enterprises and conducts research across large language models, multimodal systems, autonomous agents, and emerging physical AI. The company has published over 15 research papers at top-tier conferences including ICML, ACL, NeurIPS, and IEEE.

Team Cookie Official
Team Cookie
email us here
Visit us on social media:
LinkedIn
Facebook

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Willow Ash Roofing Announces Expanded Metal Roofing Services in Mount Pleasant, SC

Willow Ash Roofing Announces Expanded Metal Roofing Services in Mount Pleasant, SC

Mt Pleasant, SC – Willow Ash Roofing, a leading roofing contractor in the Charleston and Mt Pleasant area, is excited

February 20, 2026

Exercise & Cognitive Performance: Why Physical Activity Helps the ADD Brain

Exercise & Cognitive Performance: Why Physical Activity Helps the ADD Brain

Physical activity supports the same brain systems targeted in clinical treatment”— Dr. Stanford Owen GULFPORT, LA,

February 20, 2026

Sandy Rowley Expands Nationally, Offering AI-Enhanced SEO Services at Deep Discounts for Small Local Service Businesses

Sandy Rowley Expands Nationally, Offering AI-Enhanced SEO Services at Deep Discounts for Small Local Service Businesses

Sandy Rowley expands AI-powered SEO nationwide, offering deeply discounted marketing services for local contractors,

February 20, 2026

Treemendous Tree Care LLC Announces Expanding Stump Grinding Services in Mount Clemens, MI

Treemendous Tree Care LLC Announces Expanding Stump Grinding Services in Mount Clemens, MI

Clinton Township, MI – Treemendous Tree Care LLC, a local arborist serving Southeast Michigan with excellent tree

February 20, 2026

Frogtown Roofing Plus Announces Expanded Roof Repair Services in Toledo, OH

Frogtown Roofing Plus Announces Expanded Roof Repair Services in Toledo, OH

Maumee, Ohio – Frogtown Roofing Plus, a professional and licensed roofing company, is excited to announce it’s

February 20, 2026

Longtree Tree Service Announces Expanded Stump Grinding Services in Farmington, MI

Longtree Tree Service Announces Expanded Stump Grinding Services in Farmington, MI

Southfield, MI – Longtree Tree Service, a leading tree care and arborist company, is happy to announce it’s expanding

February 20, 2026

Islands In The Sun BBQ Announces ‘Top Deals of the Year Blowout’

Islands In The Sun BBQ Announces ‘Top Deals of the Year Blowout’

Canyon Lake, CA – Islands In The Sun BBQ, a leading online store specialising in premium outdoor kitchens and grills,

February 20, 2026

Skills-Based Hiring Takes Center Stage: Whitman Associates Expands Placement Strategies for 2026

Skills-Based Hiring Takes Center Stage: Whitman Associates Expands Placement Strategies for 2026

Whitman Associates shifts to skills-based hiring, prioritizing experience over credentials as the staffing industry

February 20, 2026

Adams Pool Solutions Expands Commercial Pool Construction Division to Meet Growing Regional Demand

Adams Pool Solutions Expands Commercial Pool Construction Division to Meet Growing Regional Demand

PLEASANTON, CA – February 13, 2026 – PRESSADVANTAGE – Adams Pool Solutions has announced the expansion of its

February 20, 2026

Composite Bonding East Dulwich Cosmetic Dentist Dr Mori Shahid Recommends Treatments at The Gardens Dental Centre (Smile 4 U)

Composite Bonding East Dulwich Cosmetic Dentist Dr Mori Shahid Recommends Treatments at The Gardens Dental Centre (Smile 4 U)

London, England – February 13, 2026 – PRESSADVANTAGE – The Gardens Dental Centre (Smile 4 U) has announced the

February 20, 2026

Digital Data for Resilience: Dmitry Erokhin Shows How Online Data Strengthen Crisis Communication and Climate Adaptation

Digital Data for Resilience: Dmitry Erokhin Shows How Online Data Strengthen Crisis Communication and Climate Adaptation

Dmitry Erokhin at IIASA Laxenburg Austria shows how search and social media signals can strengthen crisis

February 20, 2026

FullNet Communications Declares Quarterly Cash Dividend

FullNet Communications Declares Quarterly Cash Dividend

FullNet Communications’ Board of Directors Approves 12.2% Increase in Quarterly Cash Dividend Under Its Quarterly Cash

February 20, 2026

Jacaruso Launches Lead Shark, Hotel-Specific AI Sales Intelligence

Jacaruso Launches Lead Shark, Hotel-Specific AI Sales Intelligence

AUSTIN, TEXAS, TX, UNITED STATES, February 13, 2026 /EINPresswire.com/ — Jacaruso Enterprises announces the launch of

February 20, 2026

Law Office of Jason M. Hatfield Attorney Lauri Thomas Secures 50 Percent Wage-Loss Award in Arkansas Workers’ Compensation Case

Law Office of Jason M. Hatfield Attorney Lauri Thomas Secures 50 Percent Wage-Loss Award in Arkansas Workers’ Compensation Case

Springdale, Arkansas – The Law Office of Jason M. Hatfield announced that attorney Lauri Thomas has obtained a

February 20, 2026

McCready Law Welcomes Trial Lawyer Donald R. McGarrah as Partner

McCready Law Welcomes Trial Lawyer Donald R. McGarrah as Partner

Chicago, Illinois – McCready Law today announced that trial lawyer Donald R. “Don” McGarrah has joined the firm as a

February 20, 2026

RestoPros of Southern New Hampshire Expands Emergency Restoration Services Across Region

RestoPros of Southern New Hampshire Expands Emergency Restoration Services Across Region

CANTERBURY, NH – February 13, 2026 – PRESSADVANTAGE – RestoPros of Southern New Hampshire has expanded its emergency

February 20, 2026

Mindmachines.com Advances Mind Technology with Enhanced pROSHI Protocols for Meditation Device

Mindmachines.com Advances Mind Technology with Enhanced pROSHI Protocols for Meditation Device

Dallas, Texas – February 13, 2026 – PRESSADVANTAGE – Mindmachines.com has expanded the capabilities of its ROSHIwave

February 20, 2026

TaxFree RV Highlights Montana Registration Strategy as California Vehicle Owners Face Rising Tax Burdens

TaxFree RV Highlights Montana Registration Strategy as California Vehicle Owners Face Rising Tax Burdens

RED LODGE, MT – February 13, 2026 – PRESSADVANTAGE – TaxFree RV, a vehicle registration specialist operating since

February 20, 2026

Amana Care Clinic Announces Enhanced Walk-In Medical Services for Muscatine Residents

Amana Care Clinic Announces Enhanced Walk-In Medical Services for Muscatine Residents

MUSCATINE, Iowa – February 13, 2026 – PRESSADVANTAGE – Amana Care Clinic – Muscatine has announced expanded walk-in

February 20, 2026

SASGOG Selects Momentum Association Management as New AMC Partner

SASGOG Selects Momentum Association Management as New AMC Partner

The Society for Academic Specialists in General Obstetrics and Gynecology (SASGOG) has selected Momentum Association

February 20, 2026

Radiant Autism Center Co-Founder Bobby Whitney Joins Board of The Owen Foundation

Radiant Autism Center Co-Founder Bobby Whitney Joins Board of The Owen Foundation

Radiant Autism Center co-founder Bobby Whitney joins The Owen Foundation board to expand autism advocacy and family

February 20, 2026

Bonsai Marketing Expands AI-Powered Hyper-Local Growth Platform to Help Sonoma County Businesses Dominate Search in 2026

Bonsai Marketing Expands AI-Powered Hyper-Local Growth Platform to Help Sonoma County Businesses Dominate Search in 2026

Bonsai Marketing expands its AI-powered hyper-local platform to help Sonoma County businesses dominate local search and

February 20, 2026

Backyard Banger to Showcase World’s First Garden Hose Kitchen & Wet Bar on Wheels at The Colorado Garden & Home Show

Backyard Banger to Showcase World’s First Garden Hose Kitchen & Wet Bar on Wheels at The Colorado Garden & Home Show

"Whether it's deployments, birthdays, graduations, football games, you name it, America grills in the backyard," Ty

February 20, 2026

Immersive Leadership Model Accelerates Growth Through 40 Short, Powerful ‘Moments’

Immersive Leadership Model Accelerates Growth Through 40 Short, Powerful ‘Moments’

Executive coach and bestselling author Scott Abbott's new interactive resource is designed to inspire and recharge

February 20, 2026

FLEET DATA CENTERS ANNOUNCES PRICING OF $3.8 BILLION OF SENIOR SECURED NOTES FOR HYPERSCALE FACILITY IN GROWING RENO HUB

FLEET DATA CENTERS ANNOUNCES PRICING OF $3.8 BILLION OF SENIOR SECURED NOTES FOR HYPERSCALE FACILITY IN GROWING RENO HUB

Fleet Data Centers announces pricing of $3.8 Billion of senior secured notes for Hyperscale facility in rapidly growing

February 20, 2026

Grief, Resilience, and Redemption: ’Losing Michele’ Earns Best Seller Distinction Following Widespread Praise

Grief, Resilience, and Redemption: ’Losing Michele’ Earns Best Seller Distinction Following Widespread Praise

In a powerful testament to the healing power of storytelling, author Alicia Trew’s deeply personal memoir, has

February 20, 2026

Bentley Rancho Mirage Celebrates Global Launch Of New Bentley Continental GT S And GTC S

Bentley Rancho Mirage Celebrates Global Launch Of New Bentley Continental GT S And GTC S

We’re thrilled to bring these remarkable vehicles to the Coachella Valley and to our clients who expect the very best

February 20, 2026

Darlene Zschech Releases ’Shout to the Lord (All The Earth)’ with Ingrid Rosario and Ana Paula Valadão

Darlene Zschech Releases ’Shout to the Lord (All The Earth)’ with Ingrid Rosario and Ana Paula Valadão

Australian worship leader Darlene Zschech releases a new version, “Shout To The Lord (All The Earth),” with Ana Paula

February 20, 2026

Turnaround Management Association Appoints Christine Melendes as Chief Executive Officer

Turnaround Management Association Appoints Christine Melendes as Chief Executive Officer

TMA, the leading organization dedicated to corporate restructuring and renewal has appointed Christine Melendes as

February 20, 2026

Nonprofit Alliance of Consumer Advocates and Consumer Defense Law Group Delivers $95,002 Reduction to Stop Foreclosure

Nonprofit Alliance of Consumer Advocates and Consumer Defense Law Group Delivers $95,002 Reduction to Stop Foreclosure

LA CAñADA FLINTRIDGE, CA, UNITED STATES, February 13, 2026 /EINPresswire.com/ — The Nonprofit Alliance of Consumer

February 20, 2026

From Destiny’s Child ‘Survivor’ to Real-Life Survivor: Beyoncé’s Father Dr. Mathew Knowles Wins Humanitarian Honor

From Destiny’s Child ‘Survivor’ to Real-Life Survivor: Beyoncé’s Father Dr. Mathew Knowles Wins Humanitarian Honor

Time4Sharing.org recognizes Dr. Mathew Knowles for his advocacy and leadership in support of children and early cancer

February 20, 2026

PCI Race Radios Announces Partnership with Robby Gordon and Max Gordon

PCI Race Radios Announces Partnership with Robby Gordon and Max Gordon

PCI Provides Premium In-Car Communications and Fresh-Air Systems as Gordons Gear Up for 2026 Season Just got hooked up

February 20, 2026

Woven Creative Team Honored with 2026 Pharma Choice Awards

Woven Creative Team Honored with 2026 Pharma Choice Awards

Industry professionals recognize excellence in branding and public health awareness. Our North Star is always the

February 20, 2026

Resurgence Releases New Website Resource Examining Metaxalone Use and Its Role in Addiction Treatment

Resurgence Releases New Website Resource Examining Metaxalone Use and Its Role in Addiction Treatment

JURUPA VALLEY, CA – February 13, 2026 – PRESSADVANTAGE – A newly released educational resource provides clinically

February 20, 2026

Silverback Webinar Announces Refined Webinar Platform to Support Modern Digital Communication Needs

Silverback Webinar Announces Refined Webinar Platform to Support Modern Digital Communication Needs

February 13, 2026 – PRESSADVANTAGE – Silverback Webinar has announced an updated webinar platform designed to address

February 20, 2026

Dr. Antonia Maioni Named President of John Cabot University

Dr. Antonia Maioni Named President of John Cabot University

Canadian born scholar succeeds Dr Franco Pavoncello as Head of Rome based U.S. University We believe that Dr Maioni

February 20, 2026

Toborlife AI Launches Unitree H2 Humanoid Robot Pre-Order

Toborlife AI Launches Unitree H2 Humanoid Robot Pre-Order

Toborlife AI announces the upcoming availability of the Unitree H2 humanoid robot in the North American market.

February 20, 2026

Healthcare Innovation Reaches Bradenton as Mosaic Medicine Introduces Membership Based Family Care

Healthcare Innovation Reaches Bradenton as Mosaic Medicine Introduces Membership Based Family Care

Mosaic Medicine Clinic delivers direct primary care for families with transparent pricing, longer visits, and

February 20, 2026

Another Surplus Trustee Sale Reversal: N.A.C.A. & Consumer Defense Law Group Restore Title for Los Angeles Duplex Owner

Another Surplus Trustee Sale Reversal: N.A.C.A. & Consumer Defense Law Group Restore Title for Los Angeles Duplex Owner

LOS ANGELES, CA, UNITED STATES, February 13, 2026 /EINPresswire.com/ — In a rare legal outcome that many homeowners

February 20, 2026

NBCI Launches the New National Black Breast Cancer (NBBCF) Fund Website

NBCI Launches the New National Black Breast Cancer (NBBCF) Fund Website

The Black Church is compelled to take this critical step alongside the American Clinic Health Disparities Commission.

February 20, 2026