AI Infrastructure · Platform Scale · Cloud Communications
AI Operations: When your platform hits its inflection point, operations become the product.
Most cloud and telco platforms don't fail because of bad engineering. They fail operationally when the systems built for 50 customers break at 500. I help you build the infrastructure, processes, and AI-driven systems to stay ahead of it.
2M+
users scaled on a global cloud calling platform
100+
carrier partners across 60+ markets
45%
reduction in incident analysis time
100×
better AI outcomes, published research
25+
years in telecom and cloud operations
How we work together
Three engagements. One outcome:
a platform that scales.
Every engagement is scoped to where you are. Whether you need a clear-eyed assessment, a hands-on program build, or a senior operational leader embedded in your team, the work is grounded in the same research-backed methodology.
Advisory Sprint · 3–6 weeks
AIOps Readiness Assessment
Your ops model worked at launch. The question is whether it can hold at the next order of magnitude. This engagement evaluates your current operational maturity, maps the gaps where multi-agent AI can have the highest impact, and delivers a prioritized roadmap you can execute, with or without continued engagement.
READINESS SCORECARD · PRIORITIZED ACTION PLAN · EXECUTIVE BRIEFING
Program Build · 8–16 weeks
Carrier & Partner Operations Program
Building a carrier or enterprise partner ecosystem is a different kind of scaling problem. Partner onboarding, SLA governance, joint incident resolution, compliance across markets. The operational architecture has to be right from the start or it becomes the ceiling on your growth. I design and build this infrastructure hands-on, based on the model I built at Microsoft scaling a cloud calling platform across 100+ carrier partners.
OPERATING MODEL · SLA FRAMEWORKS · RUNBOOKS · OBSERVABILITY TOOLCHAIN
Fractional Leadership · Retained
Fractional VP Operations / CTO
For companies that need senior operational leadership without the timeline or cost of a full-time executive hire. I work embedded with your team at 2–3 days per week, covering platform scale strategy, AI-driven operations, SRE organization design, and executive stakeholder alignment. You get direct access to someone who has operated at the scale you are building toward.
STRATEGIC OPS LEADERSHIP · SRE ORG DESIGN · AIOPS STRATEGY · BOARD-READY REPORTING
About
I've built these programs from the inside.
Over 25 years in telecom and cloud, I've led some of the most operationally complex infrastructure programs in the industry, at companies where the cost of getting it wrong was measured in millions of dollars and millions of users.
At Microsoft, as Principal Group Technical Program Manager, I led the global TPM and SRE organizations responsible for cloud calling services across Teams, Skype, and Azure, reaching more than 60 markets worldwide. The work I'm most proud of is the kind that's invisible when it's done right: the operational infrastructure that meant millions of users could make a call at 2am across 100 + countries and nothing broke.
At Ericsson, I served as Vice President of Program Management, leading a globally distributed R&D organization of 200+ engineers across Asia, Europe, and North America. I oversaw the delivery of transformative programs for Tier1 operators such as digital modernization, billing, charging, IoT services, and one of the first nationwide NB-IoT network rollouts in the US.
I'm also a published researcher. My paper on multi-agent AI orchestration, available on arXiv, demonstrates that coordinated AI agents achieve 100% actionable recommendation rates versus 1.7% for single-agent systems. That research is the foundation of MyAntFarm.ai, an open-source AIOps framework I'm now bringing into live telco and cloud environments.
I work with a small number of clients at a time. When I take on an engagement, you get me , not a team I manage from a distance.
Not advised on them.
Not studied them.
Built them.
"Great connectivity is more than uptime. It is trust."
Research & proof
Methodology you can cite. Outcomes you can verify.
Most consultants bring a framework they built in PowerPoint. The methodology behind this work is peer-reviewed, reproducible, and open-source. You can read the paper, run the framework, and verify the results yourself.
Platform scale
2M+ users · 100+ carriers
Scaled Microsoft's cloud calling platform from zero to over two million active users across more than 100 carrier partners in 60+ global markets while maintaining 99.999% availability throughout.
Operational impact
45% reduction in incident analysis time
Lowered incident analysis time through the adoption of a Unified Observability Platform that simplifies diagnosis and triage across carrier partners and reducing customer-reported incidents by 30%.
Client Testimonials
Peter Nilsson
Engineering Leader at Microsoft
"If you're looking for someone who blends technical depth with exceptional people skills to drive results, Philip is the one."
Charbel Azzi
Technical Product Owner at Telstra
"His reliability, professionalism, and willingness to go above and beyond made meaningful impact on our work."
Charilaos Christopoulos
CTO & VP at Ericsson
"Philip is customer obsessed, being it internal or external customers. He drives his tasks with energy and passion - balancing product quality with cost and efficiencies."