Tired of OpenAI's limitations for private data and eager to experiment with RAG on my own terms, I dove headfirst into a holiday quest: building a local, OpenAI-free RAG application. While countless tutorials guide Full Stack development, the "AI" magic often relies on OpenAI APIs, leaving private data concerns unresolved. So, fueled by frustration and holiday spirit, I embarked on ...
In today's data-driven landscape, enterprises are increasingly seeking to leverage the power of artificial intelligence (AI) to unlock new insights and automate tasks. However, commercial SaaS AI models often struggle to handle the specific data (usually hidden behind firewalls) and nuances of large organizations. This is where retrieval-augmented generation (RAG) comes in. RAG is a powerful technique that augments ...
Mistral AI just announced Mixtral, a new open-source AI model - https://mistral.ai/news/mixtral-of-experts. Mistral AI holds a special place for me due to its Apache 2 open-source license. It truly embodies the spirit of "Open" AI. What is Mixtral?Mixtral is a new open-source (Apache 2) AI model that is based on the mixture-of-experts (MoE) architecture. MoE models are a type of ...
Having implemented a large scale Serverless integration middleware platform on AWS for a client, I thought of writing this blog post to note how serverless best practices can help organizations improve their DevOps Research and Assessment (DORA) metrics. DORA metrics are a set of four key metrics that measure the performance of software delivery teams:Deployment frequency: How often does the ...
Generative AI, with its incredible ability to create text, code, images, and music, has become a powerful tool across various industries. However, a growing concern exists surrounding "hallucinations," where AI models generate inaccurate, misleading, or outright false outputs. This phenomenon poses significant risks, from spreading misinformation to undermining the credibility of AI-generated content. What do the experts say?The reason ...
The rise of artificial intelligence (AI) has transformed many industries, and the field of software development is no exception. AI-powered code assistants like Duet AI from Google, GitHub Copilot from Microsoft, and CodeWhisperer from Amazon Web Services (AWS) are changing the way developers work by providing intelligent suggestions, automating repetitive tasks, and helping them write better code faster.While these three ...
Two days ago, Google unveiled Gemini, a new AI model that is capable of understanding and generating text, code, audio, images, and video. Although I wouldn't categorise Gemini as Artificial General Intelligence (AGI), it is a significant step towards that eventuality.A Technical Report is available at https://storage.googleapis.com/deepmind-media/gemini/gemini_1_report.pdfThere is no doubt that Gemini is a powerful tool. I might even argue ...
Step into the captivating realm of Large Language Models (LLMs) with this must-watch video featuring the brilliant Andrej Karpathy. In this video, Karpathy unfolds the intricacies of LLMs, offering a fascinating glimpse into their training methodologies, capabilities, and the thrilling promises they bring to the table. Imagine a world where artificial intelligence seamlessly generates text, translates languages, crafts creative content, ...
During his keynote at AWS re:Invent 2023, Dr. Werner Vogels discussed several crucial considerations for architects designing distributed systems in today's cloud-native era. These seven laws encompass cost optimization, resilience, profiling, application risk categorization, and observability—factors most of us inevitably take into account when crafting solutions for our customers. Notably, this was the first instance I encountered where these principles ...
“An overnight success is ten years in the making.” ― Tom Clancy, Dead or Alive----I haven't written a long form blog post in a while. So I thought I'd post an update here on what I've been working on during my absence. Hope you enjoy the read!I picked this title for a good reason too. Sometimes, for the casual onlooker, ...
This parable is an old one. I told this last week to one of my team members at work and wanted to find the original. Since this is an old parable, I found different versions of this story over the Internet. In some versions the story revolves around a broken down ship, while in others a large machine in a ...
Note: This isn’t a Bitcoin hedging post. In fact, that was the last occurrence of that word in this article. Is the Blockchain hype over?According to this year’s Gartner Hype Cycle, Blockchain has crossed the peak of unrealistic expectations and is now heading towards the trough of disillusionment. This usually means that we’ll stop hearing about Blockchain being the cure ...
What?I have a few pet coding projects in Github. I usually maintain them as private repositories. The reason I keep them private is because they are Proof of Concept type work (and are also skunk-work).One such project I started a few months back was an IoT prototype. What I wanted to develop was an application that qualifies as an IoT ...
Image Credit: http://continuousdelivery.comThe principles of Continuous Delivery and DevOps have been around for a few years. Developers and system administrators who follow the lean-startup movement are more than familiar with both. However, more often than not, implementing either or both within a traditional, large IT environment is a significant challenge compared to a new age, Web 2.0 type organization (think ...
I love it when things 'just work' and the osx-gcc-installer is a nice, all in one package that will install make and other GNU build essentials for your Mac without having to install XCode. The pre-built binaries are available for both Snow Leopard (OSX 1.6.x) and Lion (OSX 1.7.x). So this is great if you have the older version of ...
Work-In-Play Limits in Agile Software Development | All About AgileSo let’s say you set a WIP limit that no more than 3 features can be in play at any one time. You have 3 slots on the board for development, and 3 slots for testing. What happens when the testing slots are all full and the developers have capacity to ...
So I'm on Google+, the latest social addition to the set of Google products. The immediate impression is that it is a Facebook clone.However, once I started using the set of features available (at this time of invitation only beta), I noticed a few improvements over Facebook. This is in addition to the look and feel of the site, which ...
"The JavaScript age is about event streams. Modern web pages are not pages, they are event-driven applications through which information moves. The core content vessel of the web — the document object model — still exists, but not as HTML markup. The DOM is an in-memory, efficiently-encoded data structure generated by JavaScript.LAMP architectures are dead because few web applications want ...
"The JavaScript age is about event streams. Modern web pages are not pages, they are event-driven applications through which information moves. The core content vessel of the web — the document object model — still exists, but not as HTML markup. The DOM is an in-memory, efficiently-encoded data structure generated by JavaScript.LAMP architectures are dead because few web applications want ...
"HTML5 reflects the monumental changes in the way you now do business on the web and in the cloud. This article is the first in a four-part series designed to spotlight changes in HTML5, beginning with the new tags and page organization and providing high-level information on web page design, the creation of forms, the use and value of the ...