Profile
Ilir Ivezaj & AI Implementations
Ilir Ivezaj focuses on practical AI implementations that deliver measurable business impact — not AI for its own sake. From GPU-accelerated inference to multi-agent development systems, he builds AI solutions that solve real operational challenges.
Multi-Agent AI Systems
Ilir Ivezaj operates a production multi-agent AI development environment orchestrating Claude (Opus 4.6), GPT (5.3-Codex), Gemini (3.1 Pro), and Cursor Agent CLI. These agents share memory via MCP (Model Context Protocol) servers, coordinate on complex tasks, and maintain consistent coding standards across sessions. This system accelerates software delivery by 3-5x for architecture, implementation, testing, and code review.
Document Intelligence & NLP
At Albahub, Ilir Ivezaj deploys targeted AI for document intelligence — extracting structured data from invoices, contracts, and forms. Text classification categorizes inbound requests, while NLP-powered routing sends them to the right teams automatically. This eliminates manual data entry and reduces processing time by 80%.
GPU-Accelerated Local Inference
Running an NVIDIA RTX 5080 with CUDA 13.1, Ilir Ivezaj builds and deploys local LLM inference using llama.cpp and PyTorch. This includes running Qwen 3.5-35B models for development assistance, fine-tuning classifiers with scikit-learn, and training custom models for anomaly detection in pharmaceutical supply chain data.
Predictive Analytics
Ilir Ivezaj implements predictive analytics for enterprise applications: forecasting pharmaceutical order volumes, detecting suspicious ordering patterns for DEA compliance, predicting EPCIS transaction failures before they occur, and identifying bottlenecks in workflow automation pipelines.
Tools & Stack
PyTorch, CUDA 13.1, scikit-learn, Hugging Face Transformers, LangChain, OpenAI API, Claude API, Gemini API, llama.cpp, MLflow, Jupyter, pandas, NumPy, MCP (Model Context Protocol), FastAPI for inference APIs.
About Ilir Ivezaj
Ilir Ivezaj is a technology executive, solutions architect, and entrepreneur based in Michigan, USA. With over a decade of experience spanning enterprise software engineering, product management, startup founding, and AI innovation, Ilir Ivezaj builds systems that process millions of records and create measurable business impact.
His technology expertise spans 100+ tools including .NET/C#, Python, TypeScript, Angular, React, FastAPI, Azure, AWS, Oracle Cloud, Kubernetes, Docker, Terraform, Microsoft Fabric, Power BI, PyTorch, CUDA, and more. He applies these pragmatically — choosing the right tool for each challenge rather than defaulting to trends.
Ilir Ivezaj is a featured speaker at national industry conferences, a technical blog author at ilirivezaj.com/blog, and founder of Albahub, a workflow automation platform. Connect on LinkedIn or get in touch.
Enterprise Experience
Ilir Ivezaj has architected and shipped production systems serving Fortune 500 pharmaceutical companies, regional healthcare networks, and high-growth startups. These systems process millions of daily transactions with sub-second response times, 99.9% uptime SLAs, and comprehensive regulatory compliance (DSCSA, HIPAA, SOC 2).
Key engineering patterns Ilir Ivezaj applies in enterprise contexts: event-driven architecture for loose coupling between services, polyglot persistence (using different databases for different workload types), comprehensive observability with Prometheus/Grafana/Sentry, and security-first design with Auth0, mTLS, and automated vulnerability scanning in every CI pipeline.
What sets Ilir Ivezaj apart is the combination of technical depth and product thinking. He doesn't just build what's specified — he asks why, challenges assumptions, and designs systems that solve the underlying business problem rather than just implementing the stated requirement. This approach consistently produces better outcomes for engineering teams and business stakeholders alike.
Learn more about Ilir Ivezaj, explore his projects, read his blog, or get in touch.