How do I build my own language model? Fine-tuning a Large Language Model (LLM)

by Helle Hannken-Illjes on May 24, 2024

Large Language Models (LLM), such as those behind ChatGPT, are transforming technical communication. There are many use cases for AI, from text generation to voice assistants. However, despite their power, generic LLMs sometimes do not provide the right solution for a company's specific projects. Generic AIs have a lot of general knowledge and can handle many languages and general texts – but they often struggle with requests from specific domains.

Is it possible to train a generative AI in a corporate language and to train it with the knowledge of the company?

We have trained and fine-tuned an LLM with technical documentation in two languages and examined the effects. In our presentation, we show the selection criteria for an LLM, the process of fine-tuning the LLM, and the results of the company-specific LLM.

What you will learn:

  • What are Large Language Models?
  • Use of language models in technical communication
  • Fine-tuning a large language model: process and results
  • Outlook for productive use in technical communication

The video is a recording of the presentation given by Helle Hannken-Illjes and Ulrike Parson at the tekom annual conference 2023. The presentation is in German.

How do I build my own language model? Fine-tuning a Large Language Model

We advise on the use of artificial intelligence, for example through the use of language models such as ChatGPT, in technical communication and for the various target-group- oriented channels. Find out more!

Add new comment

Your email address will not be published.

You might also be interested in

White paper “Artificial Intelligence in technical communication: application, opportunities, risks, and how it will change our profession”

by on February 07, 2024

Artificial Intelligence (AI) is revolutionizing the world, including the world of technical communication. Never has it been so easy to automatically create or edit text and images with the help of generative AI such as ChatGPT. What does this mean for the field of technical communication and the technical writing community? What are the opportunities and risks? more...

tcworld highlights: the top 5 of Mette, Helle, Lukas and Ozan

by Mette Lilienthal , Helle Hannken-Illjes , Lukas Jetzig , Ozan Yıldırır on December 01, 2023

For the first time, our colleagues Mette Lilienthal, Helle Hannken-Illjes, Lukas Jetzig and Ozan Yıldırır attended the tcworld conference in person. They joined panel discussions and presentations, and some of them were also speakers themselves. We wanted to know what they found particularly interesting as tcworld newcomers and which were their top 5 contributions. more...

parson and LangTec announce AI cooperation

November 02, 2023

The use of artificial intelligence is becoming increasingly important in technical communication. In order to offer our customers the best possible solutions for AI-based text and language technology applications, parson has started a cooperation with the Hamburg-based technology provider LangTec. more...