- Yaro on AI and Tech Trends
- Posts
- š£This Is How AI Is Rewriting the Rules of Cyber Warfare
š£This Is How AI Is Rewriting the Rules of Cyber Warfare
What Is Reinforcement Learning as a Service?
Get in Front of 50k Tech Leaders: Grow With Us
AI is a powerful tool for innovation and, increasingly, for crime. As cyber warfare becomes more sophisticated, attacks are becoming more targeted, autonomous, and harder to stop. In this issue, we break down how AI is reshaping the ransomware threat, what RLaaS really means, and whoās leading the charge. Plus, what is āvibe workingā but will it work for you or take your work? Letās dive in and stay curious.
What Is Reinforcement Learning as a Service?
AI Tools - Reinforcement learning
Now we have Vibe working? But what is it?
AI Guides
This Is How AI Is Rewriting the Rules of Cyber Warfare
š° AI News and Trends
OpenAI takes on Google, Amazon with new agentic shopping system
California Governor Newsom signs landmark AI safety bill SB 53
Anthropic launches Claude Sonnet 4.5, its best AI model for coding
OpenAIās first-half revenue rises 16% to about $4.3 billion
Jensen Huang says China is ānanoseconds behindā the US in chipmaking, calls for reducing US export restrictions on Nvidiaās AI chips
Chinaās DeepSeek just launched V3.2-exp, an open-weight model built on a new āsparse attentionā design. By layering a ālightning indexerā with fine-grained token selection, it trims the compute load of long-context inference. Early tests claim API calls run at half the usual cost, with the weights already live on Hugging Face for third-party audits.
The creator of AI actress āTilly Norwood,ā who exploded across the internet over the weekend, has insisted she is an artwork, after a fierce backlash from the creative community.
What Is Reinforcement Learning as a Service?
Itās an emerging model where companies offer plug-and-play reinforcement learning tools that allow businesses to train AI systems on real-world behavior, not just data.
Instead of just feeding AI static documents, RL lets you train it by showing it how humans actually do tasks, like drafting contracts, processing invoices, or writing code. The AI gets rewarded for doing it right, and penalized for errors, just like training a dog, but at internet scale.
Why RLaaS Is Taking Off
Traditional AI is plateauing. Pretraining on scraped web data is no longer enough to improve performance.
Businesses want automation. RLaaS lets them train AI agents that mimic expert workflows and complete full tasks, not just generate text.
Cheaper than building in-house. RLaaS platforms provide the algorithms, infrastructure, and tooling without needing deep ML teams.
Whoās Building RLaaS?

Use Cases in the Wild
Law firms: Train AI to review and revise contracts
Finance: Automate document analysis and audit tasks
Dev teams: Use RL-trained coding agents like Devin (by Cognition AI)
Media: RL agents trained to generate and edit videosLeave a comment
What we are reading:
Career creator for those building a life without a blueprint. Every Monday morning, I send out First Things First, a weekly guide to staying present, productive, and purposeful.
Discover how to differentiate your firm. Get our ā7 Positioning Sins That Cost Consultancy Firms Millionsā guide when you join. Itās free, join 10,000+ consultancy executives
Now we have Vibe working? But what is it?

Vibe coding has taken the world by storm, and the models available are quite impressive. Anyone can seem to be able to vibe code an app into existence, and engineers are supercharging their output thanks to it. Now, Microsoft is launching a new way to work called āvibe workingā. Is anyone going to really work anymore? Powered by AI agents inside Word, Excel, and soon PowerPoint. The idea is that you donāt just use the app, you co-create with it.
Think of it like ChatGPT trained on Office and built to do the work, not just help with it. But if it does the work, are we training our digital replacements?
What Is Vibe Working?
āVibe workingā is Microsoftās term for agent-powered productivity inside Office apps. Using Agent Mode, you can:
Create reports, budgets, and presentations from a simple prompt
Iterate with Copilot like youāre having a conversation
Automate formatting, summaries, charts, and even branding
Itās a new pattern: AI doesnāt just assistāit takes initiative.
How It Works
Excel Agent Mode: Prompts like ābuild a loan calculatorā or āgenerate a budget trackerā trigger Copilot to create fully functional spreadsheets with charts, formulas, and formatting.
Word Vibe Writing: Prompt with goals (āclean this upā, āsummarize meeting notesā), and Copilot refines the doc, asks clarifying questions, and makes it share-ready.
Office Agent (Copilot Chat): Use natural language to request a presentation or documentāCopilot does the research, asks questions, and builds the file from scratch.
All of this is built using Anthropicās Claude models, not just GPT.
Why It Matters
True agentic productivity: You go from typing in a doc to delegating tasks to an AI.
Better iteration loops: You can now ask, fix, and reframe documents in one place.
Accessible automation: Vibe working simplifies complex tools like Excel for non-experts.
Who Can Use It?
Available on the web version of Word and Excel (PowerPoint coming soon)
Requires Microsoft 365 Personal, Family, or Frontier Program access
Agent Mode in Excel needs the Excel Labs add-in Yaro on AI and Tech Trends | Your Top AI Newsletter
š§° AI Tools of The Day
Reinforcement learning
1. Ray RLlib - An open-source library for scalable reinforcement learning from Anyscale. Supports distributed training and is used by companies like Amazon and Uber for custom RL workflows.
2. SageMaker RL - Amazonās fully managed service to build, train, and deploy RL models in the cloud. Supports simulators like Unity and RoboMaker for training environments.
3. Stable-Baselines3 - A lightweight Python library for building custom RL agents using proven algorithms like PPO, DQN, and A2C. Great for research and early-stage prototypes.
4. Applied Compute - RL-as-a-service startup by ex-OpenAI staffers. Helps enterprises fine-tune AI agents on legal, finance, and dev tasks using reinforcement learning. Currently in stealth but backed by Benchmark and Lux.
5. CleanRL - A minimal, single-file implementation of key RL algorithmsāperfect for understanding how RL works under the hood. Great for startups and solo devs.
Download our list of 1000+ Tools for free.
This Is How AI Is Rewriting the Rules of Cyber Warfare

Ransomware in 2025 has evolved into an AI-powered, highly adaptive threat, using polymorphic malware, deepfakes of executives, and autonomous network mapping to strike with speed and precision. No longer just about data theft, these attacks target control and systemic disruption, threatening healthcare, energy, and critical infrastructure. With quantum computing on the horizon, the risk of āharvest now, decrypt laterā makes post-quantum encryption urgent. Defenders must adopt behavioral AI, zero-trust policies, offline backups, and deepfake readiness to keep pace. This isnāt just a cyber riskāitās a strategic battlefield.Share
š§° AI Guides
Deep Reinforcement Learning
Free, open-source, beginner ā advanced track
Hands-on training with RL libraries like Stable Baselines3, CleanRL, etc.
Mixes theory and practice (algorithms, environments, agent training)
Bonus resource: OpenAI āSpinning Up in Deep RLā
Itās a practical RL primer with code, theory, and guidance for how to begin experiments.
Explore our AI Guides ā from coding to photography and beyond, find step-by-step tips to put AI to work for you.
Reply