Get GenAI guide

Access HaxiTAG GenAI research content, trends and predictions.

Monday, August 11, 2025

Goldman Sachs Leads the Scaled Deployment of AI Software Engineer Devin: A Milestone in Agentic AI Adoption in Banking

August 11, 2025

In the context of the banking sector’s transformation through digitization, cloud-native technologies, and the emergence of intelligent systems, Goldman Sachs has become the first major bank to pilot AI software engineers at scale. This initiative is not only a forward-looking technological experiment but also a strategic bet on the future of hybrid workforce models. The developments and industry signals highlighted herein are of milestone significance and merit close attention from enterprise decision-makers and technology strategists.

Devin and the Agentic AI Paradigm: A Shift in Banking Technology Productivity

Devin, developed by Cognition AI, is rooted in the Agentic AI paradigm, which emphasizes autonomy, adaptivity, and end-to-end task execution. Unlike conventional AI assistance tools, Agentic AI exhibits the following core attributes:

Autonomous task planning and execution: Devin goes beyond code generation; it can deconstruct goals, orchestrate resources, and iteratively refine outcomes, significantly improving closed-loop task efficiency.
High adaptivity: It swiftly adapts to complex fintech environments, integrating seamlessly with diverse application stacks such as Python microservices, Kubernetes clusters, and data pipelines.
Continuous learning: By collaborating with human engineers, Devin continually enhances code quality and delivery cadence, building organizational knowledge over time.

According to IT Home and Sina Finance, Goldman Sachs has initially deployed hundreds of Devin instances and plans to scale this to thousands in the coming years. This level of deployment signals a fundamental reconfiguration of the bank’s core IT capabilities.

Insight: The integration of Devin is not merely a cost-efficiency play—it is a commercial validation of end-to-end intelligence in financial software engineering and indicates that the AI development platform is becoming a foundational infrastructure in the tech strategies of leading banks.

Cognition AI’s Vertical Integration: Building a Closed-Loop AI Engineer Ecosystem

Cognition AI has reached a valuation of $4 billion within two years, supported by notable venture capital firms such as Founders Fund and 8VC, reflecting strong capital market confidence in the Agentic AI track. Notably, its recent acquisition of AI startup Windsurf has further strengthened its AI engineering ecosystem:

Windsurf specializes in low-latency inference frameworks and intelligent scheduling layers, addressing performance bottlenecks in multi-agent distributed execution.
The acquisition enables deep integration of model inference, knowledge base management, and project delivery platforms, forming a more comprehensive enterprise-grade AI development toolchain.

This vertical integration and platformization offer compelling value to clients in banking, insurance, and other highly regulated sectors by mitigating pilot risks, simplifying compliance processes, and laying a robust foundation for scaled, production-grade deployment.

Structural Impact on Banking Workforce and Human Capital

According to projections by Sina Finance and OFweek, AI—particularly Agentic AI—will impact approximately 200,000 technical and operational roles in global banking over the next 3–5 years. Key trends include:

Job transformation: Routine development, scripting, and process integration roles will shift towards collaborative "human-AI co-creation" models.
Skill upgrading: Human engineers must evolve from coding executors to agents' orchestrators, quality controllers, and business abstraction experts.
Diversified labor models: Reliance on outsourced contracts will decline as internal AI development queues and flexible labor pools grow.

Goldman Sachs' adoption of a “human-AI hybrid workforce” is not just a technical pilot but a strategic rehearsal for future organizational productivity paradigms.

Strategic Outlook: The AI-Driven Leap in Financial IT Production

Goldman’s deployment of Devin represents a paradigm leap in IT productivity—centered on the triad of productivity, compliance, and agility. Lessons for other financial institutions and large enterprises include:

Strategic dimension: AI software engineering must be positioned as a core productive force, not merely a support function.
Governance dimension: Proactive planning for agent governance, compliance auditing, and ethical risk management is essential to avoid data leakage and accountability issues.
Cultural dimension: Enterprises must nurture a culture of “human-AI collaboration” to promote knowledge sharing and continuous learning.

As an Agentic AI-enabled software engineer, Devin has demonstrated its ability to operate autonomously and handle complex tasks in mission-critical banking domains such as trading, risk management, and compliance. Each domain presents both transformative value and governance challenges, summarized below.

Value Analysis: Trading — Enhancing Efficiency and Strategy Innovation

Automated strategy generation and validation
Devin autonomously handles data acquisition, strategy development, backtesting, and risk exposure analysis—accelerating the strategy iteration lifecycle.
Support for high-frequency, event-driven development
Built for microservice architectures, Devin enables rapid development of APIs, order routing logic, and Kafka-based message buses—ideal for low-latency, high-throughput trading systems.
Cross-asset strategy integration
Devin unifies modeling across assets (e.g., FX, commodities, interest rates), allowing standardized packaging and reuse of strategy modules across markets.

Value Analysis: Risk Management — Automated Modeling and Proactive Alerts

Automated risk model construction and tuning
Devin builds and optimizes models such as credit scoring, liquidity stress testing, and VaR systems, adapting features and parameters as needed.
End-to-end risk analysis platform development
From ETL pipelines to model deployment and dashboarding, Devin automates the full stack, enhancing responsiveness and accuracy.
Flexible scenario simulation
Devin simulates asset behavior under various stressors—market shocks, geopolitical events, climate risks—empowering data-driven executive decisions.

Value Analysis: Compliance — Workflow Redesign and Audit Enhancement

Smart monitoring and rule engine configuration
Devin builds automated rule engines for AML, KYC, and trade surveillance, enabling real-time anomaly detection and intervention.
Automated compliance report generation
Devin aggregates multi-source data to generate tailored regulatory reports (e.g., Basel III, SOX, FATCA), reducing manual workload and error rates.
Cross-jurisdictional regulation mapping and updates
Devin continuously monitors global regulatory changes and alerts compliance teams while building a dynamic regulatory knowledge graph.

Governance Mechanisms and Collaboration Frameworks in Devin Deployment

Strategic Element	Recommended Practice
Agent Governance	Assign human supervisors to each Devin instance, establishing accountability and oversight.
Change Auditing	Implement behavior logging and traceability for every decision point in the agent's workflow.
Human-AI Workflow	Embed Devin into a “recommendation-first, human-final” pipeline with manual sign-off at critical checkpoints.
Model Evaluation	Continuously monitor performance using PR curves, stability indices, and drift detection for ongoing calibration.

Devin’s application across trading, risk, and compliance showcases its capacity to drive automation, elevate productivity, and enable strategic innovation. However, deploying Agentic AI in finance demands rigorous governance, strong explainability, and clearly delineated human-AI responsibilities to balance innovation with accountability.

From an industry perspective, Cognition AI’s capital formation, product integration, and ecosystem positioning signal the evolution of AI engineering into a highly integrated, standardized, and trusted infrastructure. Devin may just be the beginning.

Final Insight: Goldman Sachs’ deployment of Devin represents the first systemic validation of Agentic AI at commercial scale. It underscores how banking is prioritizing technological leadership and hybrid workforce strategies in the next productivity revolution. As industry pilots proliferate, AI engineers will reshape enterprise software delivery and redefine the human capital landscape.

The Strategic Shift of Generative AI in the Enterprise: From Adoption Surge to Systemic Evolution

August 01, 2025

Bain & Company’s report, “Despite Barriers, the Adoption of Generative AI Reaches an All-Time High”, provides an authoritative and structured exploration of the strategic significance, systemic challenges, and capability-building imperatives of generative AI (GenAI) in enterprise services. It offers valuable insights for senior executives and technical leaders seeking to understand the business impact and organizational implications of GenAI deployment.

Generative AI at Scale: A Technological Leap Triggering Organizational Paradigm Shifts

According to Bain’s 2025 survey, 95% of U.S. enterprises have adopted generative AI, with production use cases increasing by 101% year-over-year. This leap signals not only technological maturity but a foundational shift in enterprise operating models—GenAI is no longer a peripheral innovation but a core driver reshaping workflows, customer engagement, and product development.

The IT function has emerged as the fastest adopter, integrating GenAI into modules such as code generation, knowledge retrieval, and system operations—demonstrating the technology’s natural alignment with knowledge-intensive tasks. Initially deployed to enhance operational efficiency and reduce costs, GenAI is now evolving from a productivity enhancer into a value creation engine as enterprises deepen its application.

Strategic Prioritization: Evolving Enterprise Mindsets and Readiness Gaps

Notably, the share of companies prioritizing AI as a strategic initiative has risen to 15% within a year, and 50% now have a defined implementation roadmap. This trend indicates a shift among leading firms from a narrow focus on deployment to building comprehensive AI governance frameworks—encompassing platform architecture, talent models, data assets, and process redesign.

However, the report also reveals a significant bifurcation: half of all companies still lack a clear strategy. This reflects an emerging “capability polarization” in the market. Front-runners are institutionalizing GenAI through standardized workflows, mature governance, and deep vendor partnerships, while others remain stuck in fragmented pilots without coherent organizational frameworks.

Realizing Value: A Reinforcing Feedback Loop of Performance and Confidence

Over 80% of reported use cases met or exceeded expectations, and nearly 60% of satisfied enterprises reported measurable business improvements—affirming the commercial viability of GenAI. These high-yield use cases—document generation, customer inquiry automation, internal search, reporting—share common traits: high knowledge structure, task repeatability, and stable context.

More importantly, this success has triggered a confidence flywheel: early wins → increased executive trust → expanded resource allocation → greater capabilities. Among organizations that have scaled GenAI, approximately 90% report target attainment or outperformance—highlighting the compounding marginal value of GenAI as it evolves from a tactical tool to a strategic platform.

Structural Challenges: Beyond Technical Hurdles to Organizational Complexity

Despite steep adoption curves, enterprises face three core, systemic constraints that must be addressed:

Data Security and Governance: As GenAI embeds itself deeper into critical systems, issues such as compliance, access control, and context integrity become paramount. Late-stage adopters are particularly focused on data lifecycle integrity and output accountability—underscoring the growing sensitivity to AI-related risk externalities.
Talent Gaps and Knowledge Asymmetries: 75% of companies report an inability to find internal expertise in critical functions. This is less about a shortage of AI engineers, and more about the lack of organizational infrastructure to integrate business users with AI systems—via interfaces, training, and process alignment.
Vendor Fragmentation and Ecosystem Fragility: With rapid evolution in AI infrastructure and models, long-term stability remains elusive. Concerns about vendor quality and model maintainability are surging among advanced adopters—reflecting increased strategic dependence on reliable ecosystem partners.

Reconstructing the Investment Rhythm: From Exploration Budgets to Operational Expenditures

Enterprise GenAI investment is entering a phase of structural normalization. Since early 2024, average annual AI budgets have reached $10 million—up 102% year-over-year. More significantly, 60% of GenAI projects are now funded through standard operating budgets, signaling a shift from experimental spending to institutionalized resource allocation.

This transition reflects a change in organizational perception: GenAI is no longer a one-off innovation initiative, but a core pillar within digital architecture, talent strategy, and process transformation. Enterprises are integrating GenAI into AI governance hubs and scenario-driven microservice deployments, emphasizing long-term, scalable orchestration.

Strategic Insight: GenAI as a Competitive Operating System of the Future

The central insight from Bain’s research is clear: generative AI is not just about technical deployment—it demands a fundamental redesign of organizational capabilities and cognitive infrastructure. Companies that sustainably unlock value from GenAI exhibit four shared traits:

Clear prioritization of high-value GenAI scenarios across the enterprise;
A cross-functional AI operations hub to align data, processes, models, and personnel;
A layered AI talent architecture—including prompt engineers, data governance experts, and domain modelers;
Integration of GenAI into core governance systems such as budgeting, KPIs, compliance, ethics, and knowledge management.

In the coming years, enterprise competition will no longer hinge on whether GenAI is adopted, but on how effectively organizations rewire their business models, restructure internal systems, and build defensible, sustainable AI capabilities. GenAI will become a benchmark for digital maturity—and a decisive differentiator in asymmetric competition.

Conclusion

Bain’s research offers a mirror reflecting how deeply generative AI is transforming the enterprise landscape. In this era of complex technological and organizational convergence, companies must look beyond tools and models. Strategic vision, systemic governance, and human-AI symbiosis are essential to unleashing the full multiplier effect of GenAI. Only with such a holistic approach can organizations seize the opportunity to lead in the next wave of digital transformation—and shape the future of business itself.

AI Automation: A Strategic Pathway to Enterprise Intelligence in the Era of Task Reconfiguration

July 13, 2025

With the rapid advancement of generative AI and task-level automation, the impact of AI on the labor market has gone far beyond the simplistic notion of "job replacement." It has entered a deeper paradigm of task reconfiguration and value redistribution. This transformation not only reshapes job design but also profoundly reconstructs organizational structures, capability boundaries, and competitive strategies. For enterprises seeking intelligent transformation and enhanced service and competitiveness, understanding and proactively embracing this change is no longer optional—it is a strategic imperative.

The "Dual Pathways" of AI Automation: Structural Transformation of Jobs and Skills

AI automation is reshaping workforce structures along two main pathways:

Routine Automation (e.g., customer service responses, schedule planning, data entry): By replacing predictable, rule-based tasks, automation significantly reduces labor demand and improves operational efficiency. A clear outcome is the decline in job quantity and the rise in skill thresholds. For instance, British Telecom’s plan to cut 40% of its workforce and Amazon’s robot fleet surpassing its human workforce exemplify enterprises adjusting the human-machine ratio to meet cost and service response imperatives.
Complex Task Automation (e.g., roles involving analysis, judgment, or interaction): Automation decomposes knowledge-intensive tasks into standardized, modular components, expanding employment access while lowering average wages. Job roles like telephone operators or rideshare drivers are emblematic of this "commoditization of skills." Research by MIT reveals that a one standard deviation drop in task specialization correlates with an 18% wage decrease—even as employment in such roles doubles, illustrating the tension between scaling and value compression.

For enterprises, this necessitates a shift from role-centric to task-centric job design, and a comprehensive recalibration of workforce value assessment and incentive systems.

Task Reconfiguration as the Engine of Organizational Intelligence: Not Replacement, but Reinvention

When implementing AI automation, businesses must discard the narrow view of “human replacement” and adopt a systems approach to task reengineering. The core question is not who will be replaced, but rather:

Which tasks can be automated?
Which tasks require human oversight?
Which tasks demand collaborative human-AI execution?

By clearly classifying task types and redistributing responsibilities accordingly, enterprises can evolve into truly human-machine complementary organizations. This facilitates the emergence of a barbell-shaped workforce structure: on one end, highly skilled "super-individuals" with AI mastery and problem-solving capabilities; on the other, low-barrier task performers organized via platform-based models (e.g., AI operators, data labelers, model validators).

Strategic Recommendations:

Accelerate automation of procedural roles to enhance service responsiveness and cost control.
Reconstruct complex roles through AI-augmented collaboration, freeing up human creativity and judgment.
Shift organizational design upstream, reshaping job archetypes and career development around “task reengineering + capability migration.”

Redistribution of Competitive Advantage: Platform and Infrastructure Players Reshape the Value Chain

AI automation is not just restructuring internal operations—it is redefining the industry value chain.

Platform enterprises (e.g., recruitment or remote service platforms) have inherent advantages in standardizing tasks and matching supply with demand, giving them control over resource allocation.
AI infrastructure providers (e.g., model developers, compute platforms) build strategic moats in algorithms, data, and ecosystems, exerting capability lock-in effects downstream.

To remain competitive, enterprises must actively embed themselves within the AI ecosystem, establishing an integrated “technology–business–talent” feedback loop. The future of competition lies not between individual companies, but among ecosystems.

Societal and Ethical Considerations: A New Dimension of Corporate Responsibility

AI automation exacerbates skill stratification and income inequality, particularly in low-skill labor markets, where “new structural unemployment” is emerging. Enterprises that benefit from AI efficiency gains must also fulfill corresponding responsibilities:

Support workforce skill transition through internal learning platforms and dual-capability development (“AI literacy + domain expertise”).
Participate in public governance by collaborating with governments and educational institutions to promote lifelong learning and career retraining systems.
Advance AI ethics governance to ensure fairness, transparency, and accountability in deployment, mitigating hidden risks such as algorithmic bias and data discrimination.

AI Is Not Destiny, but a Matter of Strategic Choice

As one industry mentor aptly stated, “AI is not fate—it is choice.” How a company defines which tasks are delegated to AI essentially determines its service model, organizational form, and value positioning. The future will not be defined by “AI replacing humans,” but rather by “humans redefining themselves through AI.”

Only by proactively adapting and continuously evolving can enterprises secure their strategic advantage in this era of intelligent reconfiguration.

Insights & Commentary: AI-Driven Personalized Marketing — Paradigm Shift from Technical Frontier to Growth Core

July 30, 2025

In the wave of digital transformation, personalized marketing has evolved from a “nice-to-have” tactic to a central engine driving enterprise growth and customer loyalty. McKinsey’s report “The New Frontier of Personalization” underscores this shift and systematically highlights how Artificial Intelligence (AI), especially Generative AI (Gen AI), has become the catalytic force behind this revolution.

Key Insight

We are at a pivotal inflection point — enterprises must view AI-driven personalization not as a mere technology upgrade or marketing tool, but as a strategic investment to rebuild customer relationships, optimize business outcomes, and construct enduring competitive advantages. This necessitates a fundamental overhaul of technology stacks, organizational capabilities, and operational philosophies.

Strategic Perspective: Bridging the Personalization Gap through AI

McKinsey’s data sharply reveals a core contradiction in the market: 71% of consumers expect personalized interactions, yet 76% feel frustrated when this expectation isn’t met. This gap stems from the limitations of traditional marketing — reliant on manual efforts, fragmented processes, and a structural conflict between scale and personalization.

The emergence of AI, particularly Gen AI, offers a historic opportunity to bridge this fundamental gap.

From Broad Segmentation to Precision Targeting

Traditional marketing depends on coarse demographic segmentation. In contrast, AI leverages deep learning models to analyze vast, multi-dimensional first-party data in real time, enabling precise intent prediction at the individual level. This shift empowers businesses to move beyond static lifecycle management towards dynamic, propensity-based decision-making — such as predicting the likelihood of a user responding to a specific promotion — thereby enabling optimal allocation of marketing resources.

From Content Bottlenecks to Creative Explosion

Content is the vehicle of personalization, but conventional content production is the primary bottleneck of marketing automation. Gen AI breaks this constraint, enabling the automated generation of hyper-personalized copy, images, and even videos around templated narratives — at speeds tens of times faster than traditional methods. This is not only an efficiency leap, but a revolution in scalable creativity, allowing brands to “tell a unique story to every user.”

Execution Blueprint: Five Pillars of Next-Generation Intelligent Marketing

McKinsey outlines five pillars — Data, Decisioning, Design, Distribution, and Measurement — to build a modern personalization architecture. For successful implementation, enterprises should focus on the following key actions:

Data: Treat customer data as a strategic asset, not an IT cost. The foundation is a unified, clean, and real-time accessible Customer Data Platform (CDP), integrating touchpoint data from both online and offline interactions to construct a 360-degree customer view — fueling AI model training and inference.

Decisioning: Build an AI-powered “marketing brain.” Enterprises should invest in intelligent engines that integrate predictive models (e.g., purchase propensity, churn risk) with business rules, dynamically optimizing the best content, channel, and timing for each customer — shifting from human-driven to algorithm-driven decisions.

Design: Embed Gen AI into the creative supply chain. This requires embedding Gen AI tools into the content lifecycle — from ideation and compliance to version iteration — and close collaboration between marketing and technical teams to co-develop tailored models that align with brand values.

Distribution: Enable seamless, real-time omnichannel execution. Marketing instructions generated by the decisioning engine must be precisely deployed via automated distribution systems across email, apps, social media, physical stores, etc., ensuring consistent experience and real-time responsiveness.

Measurement: Establish a responsive, closed-loop attribution and optimization system. Marketing impact must be validated through rigorous A/B testing and incrementality measurement. Feedback loops should inform decision engines to drive continuous strategy refinement.

Closed-Loop Automation and Continuous Optimization

From data acquisition and model training to content production, campaign deployment, and impact evaluation, enterprises must build an end-to-end automated workflow. Cross-functional teams (marketing, tech, compliance, operations) should operate in agile iterations, using A/B tests and multivariate experiments to achieve continuous performance enhancement.

Technical Stack and Strategic Gains

By applying data-driven customer segmentation and behavioral prediction, enterprises can tailor incentive strategies across customer lifecycle stages (acquisition, retention, repurchase, cross-sell) and campaign objectives (branding, promotions), and deliver them consistently across multiple channels (web, app, email, SMS). This can lead to a 1–2% increase in sales and a 1–3% gain in profit margins — anchored on a “always-on” intelligent decision engine capable of real-time optimization.

Marketing Technology Framework by McKinsey

Data: Curate structured metadata and feature repositories around campaign and content domains.
Decisioning: Build interpretable models for promotional propensity and content responsiveness.
Design: Generate and manage creative variants via Gen AI workflows.
Distribution: Integrate DAM systems with automated campaign pipelines.
Measurement: Implement real-time dashboards tracking impact by channel and creative.

Gen AI can automate creative production for targeted segments with ~50x efficiency, while feedback loops continuously fine-tune model outputs.

However, most companies remain in manual pilot stages, lacking true end-to-end automation. To overcome this, quality control and compliance checks must be embedded in content models to eliminate hallucinations and bias while aligning with brand and legal standards.

Authoritative Commentary: Challenges and Outlook

In today’s digital economy, consumer demand for personalized engagement is surging: 71% expect it, 76% are disappointed when unmet, and 65% cite precision promotions as a key buying motivator.

Traditional mass, manual, and siloed marketing approaches can no longer satisfy this diversity of needs or ensure sustainable ROI. Yet, the shift to AI-driven personalization is fraught with challenges:

Three Core Challenges for Enterprises

Organizational and Talent Transformation: The biggest roadblock isn’t technology, but organizational inertia. Firms must break down silos across marketing, sales, IT, and data science, and nurture hybrid talent with both technical and business acumen.
Technological Integration Complexity: End-to-end automation demands deep integration of CDP, AI platforms, content management, and marketing automation tools — placing high demands on enterprise architecture and system integration capabilities.
Balancing Trust and Ethics: Where are the limits of personalization? Data privacy and algorithmic ethics are critical. Mishandling user data or deploying biased models can irreparably damage brand trust. Transparent, explainable, and fair AI governance is essential.

Conclusion

AI and Gen AI are ushering in a new era of precision marketing — transforming it from an “art” to an “exact science.” Those enterprises that lead the charge in upgrading their technology, organizational design, and strategic thinking — and successfully build an intelligent, closed-loop marketing system — will gain decisive market advantages and achieve sustainable, high-quality growth. This is not just the future of marketing, but a necessary pathway for enterprises to thrive in the digital economy.

Developing LLM-based GenAI Applications: Addressing Four Key Challenges to Overcome Limitations
Analysis of AI Applications in the Financial Services Industry
Application of HaxiTAG AI in Anti-Money Laundering (AML)
Analysis of HaxiTAG Studio's KYT Technical Solution
Strategies and Challenges in AI and ESG Reporting for Enterprises: A Case Study of HaxiTAG
HaxiTAG ESG Solutions: Best Practices Guide for ESG Reporting
Impact of Data Privacy and Compliance on HaxiTAG ESG System

Generative AI as a "Cybernetic Teammate": Deep Insights into a New Paradigm of Team Collaboration

July 23, 2025

Case Overview and Thematic Innovation

This case is based on the study “The Cybernetic Teammate: A Field Experiment on Generative AI Reshaping Teamwork and Expertise”, which explores the multifaceted impact of generative AI on team collaboration, knowledge sharing, and emotional experiences in enterprise-level new product development. Drawing from a sample of 776 professionals at Procter & Gamble, the study employed a 2×2 randomized controlled trial, comparing individual versus team work with and without AI assistance. Findings reveal that individuals using GPT-4-based generative AI matched or exceeded the performance of traditional two-person teams, demonstrating marked advantages in innovation output, cross-disciplinary integration, and emotional motivation.

Key Innovations in the Study:

Redefining Team Structures: AI evolves from a mere auxiliary tool to a “cybernetic teammate,” gradually replacing certain collaborative functions within real-world team settings.
Cross-Disciplinary Knowledge Integration: Generative AI effectively bridges gaps between domains—such as business and technology or R&D and marketing—enabling individuals with non-specialist backgrounds to produce high-quality solutions with both technical and commercial value.
Emotional and Social Support: Beyond information and decision-making assistance, AI interactions resembling human conversation were found to uplift participants’ emotional states, enhancing job satisfaction and team cohesion.

Application Scenarios and Effectiveness

Practical Use Cases

New Product Development & Innovation: In consumer goods companies like P&G, new product development relies heavily on cross-functional collaboration. This study showcases AI’s potential in ideating, evaluating, and optimizing product solutions in real business contexts.
Cross-Functional Collaboration: Traditionally, communication gaps exist between business experts and R&D specialists due to differing priorities. The integration of generative AI helped bridge these divides, enabling more balanced and comprehensive solutions.
Skill Acceleration and Agile Execution: With just one hour of AI training, participants quickly mastered tool usage and completed tasks faster than traditional teams, saving approximately 12%–16% of work time.

Performance and Utility

Productivity Gains: Data indicate that individuals using AI alone achieved performance levels comparable to traditional teams, with a performance improvement of 0.37 standard deviations. AI-assisted teams performed slightly better, suggesting AI's capacity to replicate team synergy in the short term.
Enhanced Innovation: Solutions created with AI showed significant improvements in creativity and completeness. Notably, the probability of AI-assisted teams producing top 10% solutions increased by 9.2 percentage points over non-AI teams, underscoring AI’s capacity to stimulate breakthrough thinking.
Emotional and Social Experience: AI users reported higher levels of excitement, energy, and satisfaction, while anxiety and frustration were notably reduced. This affirms AI’s positive role in emotional support and psychological motivation.

Strategic Implications and Intelligent Transformation

Rethinking Team Composition and Organizational Design

The Rise of the “Cybernetic Teammate”: Generative AI is shifting from a passive tool to an active team member. Organizations can leverage AI to streamline team structures, optimize resource allocation, and enhance collaborative efficiency.
Catalyst for Cross-Departmental Integration: AI facilitates deeper interaction and knowledge sharing across formerly siloed departments, enabling multidimensional innovation. Enterprises should consider building AI-assisted, cross-functional work models to unleash internal potential.

Enhancing Decision-Making and Innovation Capacity

Intelligent Decision Support: By delivering real-time, multi-perspective insights on complex problems, generative AI enables employees to formulate well-rounded solutions quickly, thereby improving decision accuracy and creative outcomes.
Training and Skill Transformation: As AI tools become integral to daily work, organizations should invest in upskilling employees in AI operation and cognitive adaptation to support a smooth transition to intelligent workflows and organizational capability upgrades.

Long-Term Vision and Strategic Planning

Harnessing Human-AI Synergy: While current findings reflect short-term impacts, the long-term potential of AI will grow with user proficiency and system evolution. Future research should examine AI’s enduring role in fostering innovation, professional development, and shaping corporate culture.
Building Trust and Emotional Connection: The success of AI integration depends not only on efficiency gains but also on cultivating trust and emotional affinity. Designing more human-centric, interactive AI systems can help organizations build workplaces that are both productive and emotionally supportive.

Conclusion

This case offers valuable empirical insights into the application of generative AI in enterprise settings, demonstrating its critical role in enhancing productivity, fostering cross-departmental collaboration, and enriching emotional experiences at work. As technology evolves and workforce capabilities improve, generative AI is poised to become a driving force for intelligent enterprise transformation and collaborative optimization. When shaping future work models, organizations must prioritize not only the efficiency brought by technological empowerment but also the cultivation of trust and emotional synergy in human-AI collaboration, to truly realize a digital and intelligent future.

OpenAI’s Seven Key Lessons and Case Studies in Enterprise AI Adoption

July 18, 2025

AI is Transforming How Enterprises Work

OpenAI recently released a comprehensive guide on enterprise AI deployment, openai-ai-in-the-enterprise.pdf, based on firsthand experiences from its research, application, and deployment teams. It identified three core areas where AI is already delivering substantial and measurable improvements for organizations:

Enhancing Employee Performance: Empowering employees to deliver higher-quality output in less time
Automating Routine Operations: Freeing employees from repetitive tasks so they can focus on higher-value work
Enabling Product Innovation: Delivering more relevant and responsive customer experiences

However, AI implementation differs fundamentally from traditional software development or cloud deployment. The most successful organizations treat AI as a new paradigm, adopting an experimental and iterative approach that accelerates value creation and drives faster user and stakeholder adoption.

OpenAI’s integrated approach — combining foundational research, applied model development, and real-world deployment — follows a rapid iteration cycle. This means frequent updates, real-time feedback collection, and continuous improvements to performance and safety.

Seven Key Lessons for Enterprise AI Deployment

Lesson 1: Start with Rigorous Evaluation
Case: How Morgan Stanley Ensures Quality and Safety through Iteration

As a global leader in financial services, Morgan Stanley places relationships at the core of its business. Faced with the challenge of introducing AI into highly personalized and sensitive workflows, the company began with rigorous evaluations (evals) for every proposed use case.

Evaluation is a structured process that assesses model performance against benchmarks within specific applications. It also supports continuous process improvement, reinforced with expert feedback at each step.

In its early stages, Morgan Stanley focused on improving the efficiency and effectiveness of its financial advisors. The hypothesis was simple: if advisors could retrieve information faster and reduce time spent on repetitive tasks, they could provide more and better insights to clients.

Three initial evaluation tracks were launched:

Translation Accuracy: Measuring the quality of AI-generated translations
Summarization: Evaluating AI’s ability to condense information using metrics for accuracy, relevance, and coherence
Human Comparison: Comparing AI outputs to expert responses, scored on accuracy and relevance

Results: Today, 98% of Morgan Stanley advisors use OpenAI tools daily. Document access has increased from 20% to 80%, and search times have dropped dramatically. Advisors now spend more time on client relationships, supported by task automation and faster insights. Feedback has been overwhelmingly positive — tasks that once took days now take hours.

Lesson 2: Embed AI into Products
Case: How Indeed Humanized Job Matching

AI’s strength lies in handling vast datasets from multiple sources, enabling companies to automate repetitive work while making user experiences more relevant and personalized.

Indeed, the world’s largest job site, now uses GPT-4o mini to redefine job matching.

The “Why” Factor: Recommending good-fit jobs is just the beginning — it’s equally important to explain why a particular role is suggested.

By leveraging GPT-4o mini’s analytical and language capabilities, Indeed crafts natural-language explanations in its messages and emails to job seekers. Its popular "invite to apply" feature also explains how a candidate’s background makes them a great fit.

When tested against the prior matching engine, the GPT-powered version showed:

A 20% increase in job application starts
A 13% improvement in downstream hiring success

Given that Indeed sends over 20 million messages monthly and serves 350 million visits, these improvements translate to major business impact.

Scaling posed a challenge due to token usage. To improve efficiency, OpenAI and Indeed fine-tuned a smaller model that achieved similar results with 60% fewer tokens.

Helping candidates understand why they’re a fit for a role is a deeply human experience. With AI, Indeed is enabling more people to find the right job faster — a win for everyone.

Lesson 3: Start Early, Invest Ahead of Time
Case: Klarna’s Compounding Returns from AI Adoption

AI solutions rarely work out-of-the-box. Use cases grow in complexity and impact through iteration. Early adoption helps organizations realize compounding gains.

Klarna, a global payments and shopping platform, launched a new AI assistant to streamline customer service. Within months, the assistant handled two-thirds of all service chats — doing the work of hundreds of agents and reducing average resolution time from 11 to 2 minutes. It’s expected to drive $40 million in profit improvement, with customer satisfaction scores on par with human agents.

This wasn’t an overnight success. Klarna achieved these results through constant testing and iteration.

Today, 90% of Klarna’s employees use AI in their daily work, enabling faster internal launches and continuous customer experience improvements. By investing early and fostering broad adoption, Klarna is reaping ongoing returns across the organization.

Lesson 4: Customize and Fine-Tune Models
Case: How Lowe’s Improved Product Search

The most successful enterprises using AI are those that invest in customizing and fine-tuning models to fit their data and goals. OpenAI has invested heavily in making model customization easier — through both self-service tools and enterprise-grade support.

OpenAI partnered with Lowe’s, a Fortune 50 home improvement retailer, to improve e-commerce search accuracy and relevance. With thousands of suppliers, Lowe’s deals with inconsistent or incomplete product data.

Effective product search requires both accurate descriptions and an understanding of how shoppers search — which can vary by category. This is where fine-tuning makes a difference.

By fine-tuning OpenAI models, Lowe’s achieved:

A 20% improvement in labeling accuracy
A 60% increase in error detection

Fine-tuning allows organizations to train models on proprietary data such as product catalogs or internal FAQs, leading to:

Higher accuracy and relevance
Better understanding of domain-specific terms and user behavior
Consistent tone and voice, essential for brand experience or legal formatting
Faster output with less manual review

Lesson 5: Empower Domain Experts
Case: BBVA’s Expert-Led AI Adoption

Employees often know their problems best — making them ideal candidates to lead AI-driven solutions. Empowering domain experts can be more impactful than building generic tools.

BBVA, a global banking leader with over 125,000 employees, launched ChatGPT Enterprise across its operations. Employees were encouraged to explore their own use cases, supported by legal, compliance, and IT security teams to ensure responsible use.

“Traditionally, prototyping in companies like ours required engineering resources,” said Elena Alfaro, Global Head of AI Adoption at BBVA. “With custom GPTs, anyone can build tools to solve unique problems — getting started is easy.”

In just five months, BBVA staff created over 2,900 custom GPTs, leading to significant time savings and cross-departmental impact:

Credit risk teams: Faster, more accurate creditworthiness assessments
Legal teams: Handling 40,000+ annual policy and compliance queries
Customer service teams: Automating sentiment analysis of NPS surveys

The initiative is now expanding into marketing, risk, operations, and more — because AI was placed in the hands of people who know how to use it.

Lesson 6: Remove Developer Bottlenecks
Case: Mercado Libre Accelerates AI Development

In many organizations, developer resources are the primary bottleneck. When engineering teams are overwhelmed, innovation slows, and ideas remain stuck in backlogs.

Mercado Libre, Latin America's largest e-commerce and fintech company, partnered with OpenAI to build Verdi, a developer platform powered by GPT-4o and GPT-4o mini.

Verdi integrates language models, Python, and APIs into a scalable, unified platform where developers use natural language as the primary interface. This empowers 17,000 developers to build consistently high-quality AI applications quickly — without deep code dives. Guardrails and routing logic are built-in.

Key results include:

100x increase in cataloged products via automated listings using GPT-4o mini Vision
99% accuracy in fraud detection through daily evaluation of millions of product listings
Multilingual product descriptions adapted to regional dialects
Automated review summarization to help customers understand feedback at a glance
Personalized notifications that drive engagement and boost recommendations

Next up: using Verdi to enhance logistics, reduce delivery delays, and tackle more high-impact problems across the enterprise.

Lesson 7: Set Bold Automation Goals
Case: How OpenAI Automates Its Own Work

At OpenAI, we work alongside AI every day — constantly discovering new ways to automate our own tasks.

One challenge was our support team’s workflow: navigating systems, understanding context, crafting responses, and executing actions — all manually.

We built an internal automation platform that layers on top of existing tools, streamlining repetitive tasks and accelerating insight-to-action workflows.

First use case: Working on top of Gmail to compose responses and trigger actions. The platform pulls in relevant customer data and support knowledge, then embeds results into emails or takes actions like opening support tickets.

By integrating AI into daily workflows, the support team became more efficient, responsive, and customer-centric. The platform now handles hundreds of thousands of tasks per month — freeing teams to focus on higher-impact work.

It all began because we chose to set bold automation goals, not settle for inefficient processes.

Key Takeaways

As these OpenAI case studies show, every organization has untapped potential to use AI for better outcomes. Use cases may vary by industry, but the principles remain universal.

The Common Thread: AI deployment thrives on open, experimental thinking — grounded in rigorous evaluation and strong safety measures. The best-performing companies don’t rush to inject AI everywhere. Instead, they align on high-ROI, low-friction use cases, learn through iteration, and expand based on that learning.

The Result: Faster and more accurate workflows, more personalized customer experiences, and more meaningful work — as people focus on what humans do best.

We’re now seeing companies automate increasingly complex workflows — often with AI agents, tools, and resources working in concert to deliver impact at scale.

From Tool to Productivity Engine: Goldman Sachs' Deployment of “Devin” Marks a New Inflection Point in AI Industrialization

July 12, 2025

Goldman Sachs’ pilot deployment of Devin, an AI software engineer developed by Cognition, represents a significant signal within the fintech domain and marks a pivotal shift in generative AI’s trajectory—from a supporting innovation to a core productivity engine. Driven by increasing technical maturity and deepening industry awareness, this initiative offers three profound insights:

Human-AI Collaboration Enters a Deeper Phase

That Devin still requires human oversight underscores a key reality: current AI tools are better suited as Augmented Intelligence Partners rather than full replacements. This deployment reflects a human-centered principle of AI implementation—emphasizing enhancement and collaboration over substitution. Enterprise service providers should guide clients in designing hybrid workflows that combine “AI + Human” synergy—for example, through pair programming or human-in-the-loop code reviews—and establish evaluation metrics to monitor efficiency and risk exposure.

From General AI to Industry-Specific Integration

The financial industry, known for its data intensity, strict compliance standards, and complex operational chains, is breaking new ground by embracing AI coding tools at scale. This signals a lowering of the trust barrier for deploying generative AI in high-stakes verticals. For solution providers, this reinforces the need to shift from generic models to scenario-specific AI capability modules. Emphasis should be placed on aligning with business value chains and identifying AI enablement opportunities in structured, repeatable, and high-frequency processes. In financial software development, this means building end-to-end AI support systems—from requirements analysis to design, compliance, and delivery—rather than deploying isolated model endpoints.

Synchronizing Organizational Capability with Talent Strategy

AI’s influence on enterprises now extends well beyond technology—it is reshaping talent structures, managerial models, and knowledge operating systems. Goldman Sachs’ adoption of Devin is pushing traditional IT teams toward hybrid roles such as prompt engineers, model tuners, and software developers, demanding greater interdisciplinary collaboration and cognitive flexibility. Industry mentors should assist enterprises in building AI literacy assessment frameworks, establishing continuous learning platforms, and promoting knowledge codification through integrated data assets, code reuse, and AI toolchains—advancing organizational memory towards algorithmic intelligence.

Conclusion

Goldman Sachs’ trial of Devin is not only a forward-looking move in financial digitization but also a landmark case of generative AI transitioning from capability-driven to value-driven industrialization. For enterprise service providers and AI ecosystem stakeholders, it represents both an opportunity and a challenge. Only by anchoring to real-world scenarios, strengthening organizational capabilities, and embracing human-AI synergy as a paradigm, can enterprises actively lead in the generative AI era and build sustainable intelligent innovation systems.

AI-Driven Software Development Transformation at Rakuten with Claude Code

June 30, 2025

Rakuten has achieved a transformative overhaul of its software development process by integrating Anthropic’s Claude Code, resulting in the following significant outcomes:

Claude Code demonstrated autonomous programming for up to seven continuous hours in complex open-source refactoring tasks, achieving 99.9% numerical accuracy;
New feature delivery time was reduced from an average of 24 working days to just 5 days, cutting time-to-market by 79%;
Developer productivity increased dramatically, enabling engineers to manage multiple tasks concurrently and significantly boost output.

Case Overview, Core Concepts, and Innovation Highlights

This transformation not only elevated development efficiency but also established a pioneering model for enterprise-grade AI-driven programming.

Application Scenarios and Effectiveness Analysis

1. Team Scale and Development Environment

Rakuten operates across more than 70 business units including e-commerce, fintech, and digital content, with thousands of developers serving millions of users. Claude Code effectively addresses challenges posed by multilingual, large-scale codebases, optimizing complex enterprise-grade development environments.

2. Workflow and Task Types

Workflows were restructured around Claude Code, encompassing unit testing, API simulation, component construction, bug fixing, and automated documentation generation. New engineers were able to onboard rapidly, reducing technology transition costs.

3. Performance and Productivity Outcomes

Development Speed: Feature delivery time dropped from 24 days to just 5, representing a breakthrough in efficiency;
Code Accuracy: Complex technical tasks were completed with up to 99.9% numerical precision;
Productivity Gains: Engineers managed concurrent task streams, enabling parallel development. Core tasks were prioritized by developers while Claude handled auxiliary workstreams.

4. Quality Assurance and Team Collaboration

AI-driven code review mechanisms provided real-time feedback, improving code quality. Automated test-driven development (TDD) workflows enhanced coding practices and enforced higher quality standards across the team.

Strategic Implications and AI Adoption Advancements

From Assistive Tool to Autonomous Producer: Claude Code has evolved from a tool requiring frequent human intervention to an autonomous “programming agent” capable of sustaining long-task executions, overcoming traditional AI attention span limitations.
Building AI-Native Organizational Capabilities: Even non-technical personnel can now contribute via terminal interfaces, fostering cross-functional integration and enhancing organizational “AI maturity” through new collaborative models.
Unleashing Innovation Potential: Rakuten has scaled AI utility from small development tasks to ambient agent-level automation, executing monorepo updates and other complex engineering tasks via multi-threaded conversational interfaces.
Value-Driven Deployment Strategy: Rakuten prioritizes AI tool adoption based on value delivery speed and ROI, exemplifying rational prioritization and assurance pathways in enterprise digital transformation.

The Outlook for Intelligent Evolution

By adopting Claude Code, Rakuten has not only achieved a leap in development efficiency but also validated AI’s progression from a supportive technology to a core component of process architecture. This case highlights several strategic insights:

AI autonomy is foundational to driving both efficiency and innovation;
Process reengineering is the key to unlocking organizational potential with AI;
Cross-role collaboration fosters a new ecosystem, breaking down technical silos and making innovation velocity a sustainable competitive edge.

This case offers a replicable blueprint for enterprises across industries: by building AI-centric capability frameworks and embedding AI across processes, roles, and architectures, organizations can accumulate sustained performance advantages, experiential assets, and cultural transformation — ultimately elevating both organizational capability and business value in tandem.

Get GenAI guide

Monday, August 11, 2025

Devin and the Agentic AI Paradigm: A Shift in Banking Technology Productivity

Cognition AI’s Vertical Integration: Building a Closed-Loop AI Engineer Ecosystem

Structural Impact on Banking Workforce and Human Capital

Strategic Outlook: The AI-Driven Leap in Financial IT Production

Value Analysis: Trading — Enhancing Efficiency and Strategy Innovation

Value Analysis: Risk Management — Automated Modeling and Proactive Alerts

Value Analysis: Compliance — Workflow Redesign and Audit Enhancement

Governance Mechanisms and Collaboration Frameworks in Devin Deployment

Related Topic

Friday, August 1, 2025

Generative AI at Scale: A Technological Leap Triggering Organizational Paradigm Shifts

Strategic Prioritization: Evolving Enterprise Mindsets and Readiness Gaps

Realizing Value: A Reinforcing Feedback Loop of Performance and Confidence

Structural Challenges: Beyond Technical Hurdles to Organizational Complexity

Reconstructing the Investment Rhythm: From Exploration Budgets to Operational Expenditures

Strategic Insight: GenAI as a Competitive Operating System of the Future

The "Dual Pathways" of AI Automation: Structural Transformation of Jobs and Skills

Task Reconfiguration as the Engine of Organizational Intelligence: Not Replacement, but Reinvention

Strategic Recommendations:

Redistribution of Competitive Advantage: Platform and Infrastructure Players Reshape the Value Chain

Societal and Ethical Considerations: A New Dimension of Corporate Responsibility

AI Is Not Destiny, but a Matter of Strategic Choice

Related Topic

Wednesday, July 30, 2025

Key Insight

Strategic Perspective: Bridging the Personalization Gap through AI

From Broad Segmentation to Precision Targeting

From Content Bottlenecks to Creative Explosion

Execution Blueprint: Five Pillars of Next-Generation Intelligent Marketing

Closed-Loop Automation and Continuous Optimization

Technical Stack and Strategic Gains

Marketing Technology Framework by McKinsey

Authoritative Commentary: Challenges and Outlook

Three Core Challenges for Enterprises

Conclusion

Related topic:

Wednesday, July 23, 2025

Key Innovations in the Study:

Application Scenarios and Effectiveness

Practical Use Cases

Performance and Utility

Strategic Implications and Intelligent Transformation

Rethinking Team Composition and Organizational Design

Enhancing Decision-Making and Innovation Capacity

Long-Term Vision and Strategic Planning

Conclusion

Related topic:

Friday, July 18, 2025

Seven Key Lessons for Enterprise AI Deployment

Key Takeaways

Related topic:

Saturday, July 12, 2025

Human-AI Collaboration Enters a Deeper Phase

From General AI to Industry-Specific Integration

Synchronizing Organizational Capability with Talent Strategy

Conclusion

Related Topic

Monday, June 30, 2025

Application Scenarios and Effectiveness Analysis

1. Team Scale and Development Environment

2. Workflow and Task Types

3. Performance and Productivity Outcomes

4. Quality Assurance and Team Collaboration

Strategic Implications and AI Adoption Advancements

The Outlook for Intelligent Evolution

Related Topic

Views

Product

Labels