DMR News

Advancing Digital Conversations

Transforming Enterprise AI: Writer’s Latest Model Deciphers Text from Images

ByHuey Yee Ong

Feb 29, 2024
Transforming Enterprise AI: Writer's Latest Model Deciphers Text from Images

Transforming Enterprise AI: Writer’s Latest Model Deciphers Text from Images

Writer, a forward-thinking startup based in San Francisco, has been making strides in developing AI writing solutions tailored for the enterprise sector. The company recently announced an innovative addition to its suite of products: Palmyra-Vision, a feature of its Palmyra model capable of generating text from visual inputs such as graphs and charts.

May Habib, co-founder and CEO of Writer, emphasizes the company’s commitment to multimodal content creation. By focusing on transforming images into text outputs, Writer aims to harness the full potential of generative AI for business applications. “Our goal is to concentrate on producing textual outputs from multimodal inputs, thereby delivering insights and generated text,” Habib explained in a conversation with TechCrunch.

Innovating Text Generation from Images

This strategic move involves analyzing images to extract and generate text, a task currently centered on interpretation rather than production of visual content. Although the creation of charts and graphs from data is a possibility for the future, Writer’s current focus is on text generation from existing images.

To achieve the desired outcomes with Palmyra-Vision, Writer employs a sophisticated approach, utilizing multiple models each assigned a specific role in the process. This methodology ensures the accurate identification of content within images and the subsequent generation of text. Habib boasts of achieving an impressive four nines of accuracy through this multi-model system.

How Can Businesses Benefit from Writer’s Palmyra-Vision?

The applications of Palmyra-Vision span a wide range of business needs.

  • E-commerce Platforms:
    • Automatically update product descriptions by generating text from new images of merchandise.
    • Eliminates the need for manual updates whenever the inventory changes.
  • Data Analysis:
    • Automatically interpret and summarize the key points from charts and graphs.
    • Streamlines the analysis of complex data for quicker decision-making.
  • Regulatory Compliance:
    • Perform automated FDA compliance checks for pharmaceutical advertising.
    • Ensures ad content adheres to FDA regulations without human intervention.
  • Handwritten Note Conversion:
    • Converts handwritten notes into digital text.
    • However, this requires specific training of the model to ensure high accuracy tailored to sectors like healthcare and insurance.

The Crucial Role of Human Oversight

Despite the advanced capabilities of Palmyra-Vision, Habib advises against relying solely on AI tools without human oversight. The possibility of AI models generating inaccurate information or “hallucinating” data underscores the importance of incorporating human review into the workflow. While most of Writer’s clients understand and implement this practice, Habib acknowledges the need for more streamlined, automated workflows to ensure consistent quality control across all users.

As Writer continues to grow, having raised $126 million to date, the company is exploring partnerships with major cloud infrastructure providers to scale its operations. Following a significant $100 million Series B funding round last September, led by Iconiq, Writer is poised to expand its reach and impact in the enterprise AI sector.

With Palmyra-Vision now available, Writer sets a new benchmark in the integration of visual and textual AI capabilities, offering businesses unprecedented tools to harness the power of generative AI.


Related News:


Featured Image courtesy of rorozoa on Freepik

Huey Yee Ong

Hello, from one tech geek to another. Not your beloved TechCrunch writer, but a writer with an avid interest in the fast-paced tech scenes and all the latest tech mojo. I bring with me a unique take towards tech with a honed applied psychology perspective to make tech news digestible. In other words, I deliver tech news that is easy to read.