Google claims that Bard is improving at math and programming

Bard, Google’s beleaguered AI-powered chatbot, is slowly improving at tasks involving logic and reasoning. That’s according to a blog post published today by the tech giant, which suggests that — thanks to a technique called “implicit code execution” — Bard is now improved specifically in the areas of math and coding.

As the blog post explains, large language models (LLMs) such as Bard are essentially prediction engines. When given a prompt, they generate a response by anticipating what words are likely to come next in a sentence. That makes them exceptionally good email and essay writers, but somewhat error-prone software developers.

But wait, you might say — what about code-generating models like GitHub’s Copilot and Amazon’s CodeWhisperer? Well, those aren’t general-purpose. Unlike Bard and rivals along the lines of ChatGPT, which were trained using a vast range of text samples from the web, e-books and other resources, Copilot, CodeWhisperer and comparable code-generating models were trained and fine-tuned almost exclusively on code samples.

Motivated to address the coding and mathematics shortcomings in general LLMs, Google developed implicit code execution, which allows Bard to write and execute its own code. The latest version of Bard identifies prompts that might benefit from logical code, writes the code “under the hood,” tests it and uses the result to generate an ostensibly more accurate response.

Image Credits: Google

Based on internal benchmarking, Google says that the new Bard’s responses to “computation-based” word and math problems were improved by 30% compared to the previous Bard release. Of course, we’ll have to see whether those claims stand up to outside testing.

“Even with these improvements, Bard won’t always get it right — for example, Bard might not generate code to help the prompt response, the code it generates might be wrong or Bard may not include the executed code in its response,” Bard product lead Jack Krawczyk and VP of engineering Amarnag Subramanya wrote in the blog post. “With all that said, this improved ability to respond with structured, logic-driven capabilities is an important step toward making Bard even more helpful.”

When Google launched Bard earlier this year, it didn’t compare that favorably to the likes of Bing Chat and ChatGPT. Indeed, the rollout was a bit of a disaster, with a Google ad featuring a wrong answer by Bard — briefly tanking the company’s stock by 8%.

Reportedly, several Google employees who tested Bard prior to its release raised serious concerns to the search giant, with one person calling it a “pathological liar” and another deeming it “worse than useless.”

With implicit code generation and other enhancements, like support for new languages, multimodal queries and image generation, Google is responding to criticism — and attempting to turn the situation around.

Whether it’ll be enough to keep up with the leading generative AI chatbots in the space, though, remains to be seen. Recently, Anthropic introduced an AI chatbot model with a greatly expanded “context window,” which allows the model to converse relatively coherently for hours or even days as opposed to minutes. And OpenAI, the developer behind ChatGPT, has begun supporting plugins that supercharge ChatGPT with outside knowledge and skills.

New Entry : From Editor

Nvidia now poised to overtake Apple in market value

Stripe limits new sign-ups in India to invite-only amid stringent regulatory compliance

OpenAI disrupts five covert influence operations

Arm unveils new AI designs and software for smartphones

SpaceX to test Starship’s re-entry capabilities and heat shield in upcoming launch

Best 10 Sites to Buy Real TikTok Followers

Choosing the Right Dynamics 365 Implementation Partner for Your Business

Oracle Cloud ERP Implementation: The Ultimate Roadmap to Achieving Success

Applebee’s Happy Hour Specials Half Price Appetizers!

Applebee’s 2 for $24 Menu Special

7 Keys to Attract Top Professionals to Tech Startups

What is SERM and How Your Brand is Seen by Users

Why technology adoption goes viral

How adopting digital technologies on traditional enterprise is good for business

What are the blogs advantages and disadvantages for a business

Nvidia now poised to overtake Apple in market value

Stripe limits new sign-ups in India to invite-only amid stringent regulatory compliance

OpenAI disrupts five covert influence operations

Arm unveils new AI designs and software for smartphones

SpaceX to test Starship’s re-entry capabilities and heat shield in upcoming launch

OYO posts first annual profit of nearly ₹100 crore in FY24

Indian space startup Agnikul Cosmos successfully demonstrates 3D-printed rocket engine

How we leverage a four-pillar AI strategy

Apple could launch Apple TV app on Android

Google claims that Bard is improving at math and programming