Question ChatGPT API assisted machine translation with python. Looking for opinions.

antimuon792 · Mar 26, 2023

Heya! I wanted to test ChatGPT's capabilities as a dedicated JP-to-EN translation tool for web novels. So I made a small python program leveraging the API.

I tested my script using the second chapter from the novel どうも、前世で殺戮の魔道具を作っていた子爵令嬢です by 優木凛々

Original web novel chapter (soyetsu): n5177ic 2
ChatGPT 3.5 JP-to-EN translation (pastebin): u1uuYT9p

It feels good, but my knowledge of japanese reaches only to katakana and hiragana. I can't actually understand the source, so verifying the translation is imposible for me.

Oh, and about cost. This was 17.999 tokens, so close to 0,04 USD using the gpt-3.5-turbo-0301 model.

Thank you for your input!

P.D.: Sorry about the inconvenient link format. This is my first post so I can't post links.

Ca5e · Mar 26, 2023

ChatGPT is basically google translate which fixes grammar. It's good enough after some human editing.

antimuon792 · Mar 26, 2023

Yup! It seems to be more aware of context too. There was no human editing involved in creating the uploaded text and it doesn't feel as unnatural as google translate. There are still some things that feel wrong, but they are few and far in between in my opinion.

I've tried to implement a cheat sheet of common terms previously translated (like proper nouns and the like) to retain consistency across multiple chapters too.

Can't wait to get access to the GPT-4 model API. Preliminary tests show it's better at translating.

UnGrave · Mar 27, 2023

antimuon792 said: ↑

Yup! It seems to be more aware of context too. There was no human editing involved in creating the uploaded text and it doesn't feel as unnatural as google translate. There are still some things that feel wrong, but they are few and far in between in my opinion.

I've tried to implement a cheat sheet of common terms previously translated (like proper nouns and the like) to retain consistency across multiple chapters too.

Can't wait to get access to the GPT-4 model API. Preliminary tests show it's better at translating.
Click to expand...

I've thrown the chapter into GPT-4 if you want to look at the results.

2. How I Got Involved in the Engagement Annulment (1)
Today is the second part.

(...Am I, by any chance, reincarnated?)

When Chloe turned two years old, she suddenly recalled memories of her previous life.

In her past life, she was a "Magical Tool Developer." She loved magical tools more than anything and became the top magical toolsmith in her country's research institution at a young age. She spent almost all of her time outside of sleeping hours, immersed in development as she was told.

However, it was a time of war and strife, and magical tools were synonymous with "weapons."

Without thinking about their intended use, she pursued functionality and created weapons capable of setting the surroundings ablaze, such as cannons that could engulf everything in flames, tools that could brainwash people, and guns that could emit rays of light, all to harm others.

She only realized the terrible weapons she had created when she was in her early twenties.

It happened when she first left the capital to attend a conference. Near the border, she saw her magical tools being used for destruction and slaughter.

(Ah, I was so absorbed in my research... What have I done?)

She followed orders, created, and rejoiced when her magical tools were praised, resulting in even more development and, consequently, terrible consequences.

That's why when the magical tool research institute was engulfed in flames due to an attack from an enemy nation several days later, she felt a deep sense of relief in her heart. She thought her magical tool "children" would no longer have to kill people.

As a result of ensuring all her fellow researchers escaped, she was left behind and prayed earnestly to God.

If she could be reborn, she wanted to work with her magical tool "children" again, using her knowledge to create magical tools for the purpose of protecting people's peaceful lives.

And then, she found herself transformed from an exhausted, pale-faced researcher into a cute little girl.

(This must be a chance given to me by God. I won't do anything that prevents me from walking under the sun again.)

In her current life, Chloe, a two-year-old, resolved to do her best to protect people's peaceful lives.

From that day on, she began observing her surroundings.

(It seems that nearly a thousand years have passed since the time I was alive.)

According to the history books in her house, after her death, a massive volcanic eruption occurred, centered around the capital of the Rielga Empire, which had forced her to create murderous weapons. The empire and its surrounding countries were destroyed.

When she checked the map, there was a huge mountain where the capital once stood, and the area around it had turned into vast mountainous terrain.

Thanks to the destruction of the continent's major countries or the passage of time, magical tools had greatly changed their role from murderous weapons to items that made people's lives more convenient.

Many theories had been lost, but in their place, new theories worth researching had emerged, and they were used in much more beneficial ways than before.

As she looked at the magical tools closely tied to daily life, such as boxes that produced warm water, boxes that heated rooms, and boxes that cooled objects, Chloe thought:

(This is truly a utopia.)

So, on her seventh birthday, when her family asked, "Chloe, what do you want to be?" she puffed out her small chest and answered:

"I want to become a magical tool developer."

*

Chloe was born into the slightly unusual Viscount Madness family.

Her father was a leading expert in soil research, her mother in agricultural research, and her siblings excelled in specialized fields such as water management and metallurgy.

In other words, the Viscount Madness family was a group of experts.

They naturally accepted Chloe's obsession with magical tools. They didn't mind her excessive knowledge for a seven-year-old, bought her many books on magical tools, and supported her research.

Excited by the uncharted territory of everyday magical tools, Chloe thought about how wonderful her environment was and decided to immerse herself in research.

When she turned twelve, she successfully developed a "Magical Purification Pot" using the new magical tool technology she had learned. She had spent three years developing it after hearing that there were areas in her territory where people struggled with a lack of suitable drinking water.

This development was groundbreaking, and she obtained a patent for it.

(At this rate, I'll do my best to improve people's lives!)

However, adversity struck.

"Chloe, you really have to go to the academy."

"That's right. It's a noble's duty."

At the age of fifteen, it became necessary for her to attend the Royal Academy as a duty of the nobles in the country.

"No way! I can't waste three years of my life like that!"

But duty was duty, and there was no way around it. So, Chloe reluctantly went to the capital and started attending the academy.

Fortunately, there was a university next to the academy, and she was liked by a professor in magical tool research there. She was granted permission to come and go from the research lab, and she spent most of her time there.

After classes, she headed straight to the research lab without attending homeroom and returned to the dormitory late at night. She boycotted all school events and devoted herself entirely to magical tool research. Her classmates complained, saying, "Hey, you!" but she didn't care.

She was a textbook example of a delinquent student.

In her previous life, she was a solitary magical tool otaku, and in her current life, she was raised surrounded by an eccentric family. She severely lacked sociability and cooperativeness.

However, one day.

As Chloe was packing her bag to head to the laboratory, a female student approached her.

Constanse Solidido, the Duke's daughter.

With her distinctive silver hair and blue eyes, she was a sharp-eyed beauty, and she was the fiancée of the country's first prince.

Chloe looked at Constanse, who approached with a "Can I have a moment?" in her eyes, with an annoyed expression.

She thought Constanse would just be like the other female students, hysterically scolding her from a high and mighty perspective, saying "You! Why don't you participate in events! You should be there!"

However, Constanse took an unexpected action.

With an apologetic expression on her face, she first apologized, "I'm sorry for taking up your time," and then asked:

"Chloe, why don't you participate in events? Is there a reason?"

Surprised by the unexpected question, Chloe answered that she didn't want to waste time because she wanted to study magical tools. Constanse nodded, "I see."

"Indeed, homeroom and school events have nothing to do with magical tools. But what about in the future?"

"In the future?"

"Yes, when it comes to magical tool research, large-scale projects require money and people, right? To conduct such research, patrons are essential, aren't they? I believe the research of the head of the Madness Viscount family is also supported by many nobles."

Chloe fell silent at the truth in Constanse's words.

Even in her previous life, she recalled seeking the support of nobles for developments that did not generate revenue.

As Chloe pondered, Constanse smiled warmly.

"Your classmates are all nobles. They are potential patrons for the future. To connect with such people in the future, it's better to participate in events. A good impression will get you more money and people than a bad one."

Chloe considered her options.

Life is short, and time is limited. She wanted to use her time as effectively as possible.

However, there was some truth to what Constanse said.

Thus, albeit reluctantly, Chloe began to attend homeroom and school events.

At first, her classmates were quite hostile towards her, but each time, the smiling Constanse intervened on her behalf.

"Well, well, everyone, she has started participating, so it's all good now, isn't it?"

When the high-ranking Constanse said this gently, it seemed that they accepted it, and Chloe gradually became accepted in the class.

The communication-disabled magical tool otaku Chloe from her previous life, who still lacked cooperation and sociability, and the noble and highly sociable Constanse seemed to be opposites, but they surprisingly got along well.

Constanse found Chloe's straightforward honesty endearing, while Chloe openly thought, "She's amazing," admiring Constanse's skills in the realm of human relationships that she lacked.

Ignoring the surrounding people who wondered, "Do those two really get along?" the two became closer and closer.

In particular, Constanse looked after Chloe at every opportunity.

When Chloe spent all her money on catalysts for her magical tool research, Constanse treated her to meals at the school cafeteria, saying, "It can't be helped."

She also taught Chloe the manners of nobles, taking care of the girl who lacked both life skills and sociability.

"Could this be what they call 'friends'?"

Including her previous life, Chloe thought about the first existence she could call a "friend."

She felt bad always relying on Constanse and wanted to do something for her as well.

It was then that Chloe learned that Constanse's birthday was approaching.

"I read in a book that you give presents to friends."

Following the advice from the book, Chloe went shopping for a present on her day off.

After much hesitation, she bought an expensive handkerchief. On her way home, she stumbled upon something shocking at an antique shop in town.

"What! This magical tool!"

It was a magical tool called a "gun" from the time when Chloe was alive in her previous life, displayed in the shop's window.

As its purpose was unknown, it seemed to be treated as an archaeological artifact.

The displayed gun appeared to have been developed after Chloe's death and was more advanced than the one she knew.

Chloe was captivated, staring at the window.

"I want to see what kind of technological advancements were made after my death! But the price..."

It cost more than three times her annual tuition fee.

Realizing it was an unaffordable price, Chloe's shoulders dropped in disappointment.

Later at school, Chloe presented the handkerchief she had bought to Constanse, who was very pleased. Chloe then mentioned the ancient magical tool she had seen in town. Constanse smiled and said:

"Oh, we have some ancient magical tools displayed at our house. Please come and see!"

Chloe thought, "As expected of a Duke's family," and felt excited about the possibility of analyzing unknown technology.

She then skipped her way to Constanse's house, where she would have a fateful encounter.
Click to expand...

Note, I also requested that it to "Translate the following chapter from a japanese webnovel. Make sure to maintain common web novel translation formatting conventions, such as leaving お兄ちゃん as onii-chan."

This may have resulted in some weird side effects like line 65 in your pastebin is broken up into tiny chunks.

antimuon792 · Mar 28, 2023

UnGrave said: ↑

I've thrown the chapter into GPT-4 if you want to look at the results...
Click to expand...

Nice! It certainly looks better than the one produced by GPT-3.5.

I've been working around token counts in prompts with 3.5. If you send an entire chapter like the one I used for testing, then it exceeds the limit. And if you split each chapter in chunks, then the model start's making continuity mistakes. Like translating the same term differently, or changing the subject of the sentence at the start of each chunk. The same thing also happens while trying to keep continuity across chapters, even with GPT-4.

That's why I'm working in making the python script handle the context of each chunk. Then, by sending a fitting prompt, you can guide the model into translating a chunk of text while keeping in mind the context that surrounds it.

My script provides the model with

a cheat sheet of contextualy meaningful terms previously translated as it was generated after translating the previous chunk

the last line of the previous contiguous chunk

the first line of the next continuous chunk

and a concise context relevant summary of the story for the previous contiguous chunk

After each translation, a new cheat sheet is created for the current chunk. It keeps weights for each term based on a weighted sum of 7 different dimensions: chunk frequency, document frequency, prev chunk frequency, summary consistency, context relevance, NER, and novelty.

Before sending to the API, the cheat sheet is capped at 15 terms to keep prompts as small as posible. Sorting is done based on weight.

I do three prompts per chunk: one for translation using the previously mentioned info; one for summary creation using the previous summary and the current translated chunk; and one more for cheat sheet creation using the current chunk.

Preliminary testing shows that this works better, but I switched models to GPT-4 mid writing the program, so my current results are better from the moment the API responds.

I also attempted to implement a way fix paragraphs and line breaks. I'm basing it on counting sentences per paragraph in the original text. It kinda works? It's a little too agressive and almost makes it one sentence per paragraph.

I'll switch back models to GPT-3.5 and do more testing later. GPT-4 is awesome in what it can do, but it's 15 times as expensive.

EDIT: Oh! I forgot to add, chapter splitting is done based on token count using tiktoken. 1000 tokens for GPT-3.5, and 3000 tokens for GPT-4.

antimuon792 · Mar 30, 2023

I dunno if there's a rule against double posting. First time I've used a forum since 2014 (reddit doesn't count). Please let me know if there is one.

I added some functionality to the program. Now it gets the novel from Syosetu, then translates it, and finally converts it both to an epub file and a markdown file. All it needs is a novel code and it starts working.

I'll get it into a github repo and post it later today.

9790712701 · Apr 1, 2023

antimuon792 said: ↑

I dunno if there's a rule against double posting. First time I've used a forum since 2014 (reddit doesn't count). Please let me know if there is one.

I added some functionality to the program. Now it gets the novel from Syosetu, then translates it, and finally converts it both to an epub file and a markdown file. All it needs is a novel code and it starts working.

I'll get it into a github repo and post it later today.
Click to expand...

Could you please share your GitHub repository with me? I'd like to discuss some topics related to translating light novels using GPT-4, such as better prompt suggestions, how to divide the novel into smaller sections for input into GPT-4, and providing context throughout the process.

antimuon792 · Apr 1, 2023

9790712701 said: ↑

Could you please share your GitHub repository with me? I'd like to discuss some topics related to translating light novels using GPT-4, such as better prompt suggestions, how to divide the novel into smaller sections for input into GPT-4, and providing context throughout the process.
Click to expand...

This is the repo where I'm working on the program: https://github.com/CombobulativeDesigns/gptwntranslator

Config file gives you a little freedom on which models you want to use for which actions, so you can play around a little there.

Documentation is hit or miss right now. I'm currently wokring on improving that and the test coverage.

Later comes iterating over the prompts, and translating the metadata of the novel.

You are more than welcomed to refine the prompts used if you can.

Log in

Question ChatGPT API assisted machine translation with python. Looking for opinions.

What's your opinion on the quality?

Excelent

Good enough

Meh

Bad

I wouldn't read this even it was the only text remaining in the entire world.

antimuon792 Member

Ca5e Well-Known Member

antimuon792 Member

UnGrave ななひ～^^

antimuon792 Member

antimuon792 Member

9790712701 New Member

antimuon792 Member

Log in

Question ChatGPT API assisted machine translation with python. Looking for opinions.

What's your opinion on the quality?

Excelent

Good enough

Meh

Bad

I wouldn't read this even it was the only text remaining in the entire world.

antimuon792 Member

Ca5e Well-Known Member

antimuon792 Member

UnGrave ななひ～^^

antimuon792 Member

antimuon792 Member

9790712701 New Member

antimuon792 Member

Useful Searches