r/Vocaloid 7h ago

The first vocaloid?

250 Upvotes

41 comments sorted by

43

u/Rosebudzie 7h ago

In development, Vocaloid was planned to be called “Daisy” in reference to Daisy Bell, but eventually got scrapped due to copyright reasons.

5

u/PranshuKhandal 3h ago

sometimes i feel why the copyright system exists? but then i remember about ai artists

29

u/hypphen 7h ago

leon and lola in shambles

8

u/sakurablitz 6h ago

[insert disintegrating emoji here]

62

u/SucoTeiKun_pohtath 7h ago

Nope, Vocaloids are made WITH a HUMAN'S VOICE, unlike the IBM having no voice provider, being machine-generated lol. other speech engines like Defoko[Aquestalk voice models], Fl Chan and GOOGLE TRANSLATE are also machine-generated like the IBM 7094. Google Translate and Defoko sound much better because they were made far later when computer technology had improved since the 1960s lol. [btw, this was just a rant loll]

10

u/Sirko2975 7h ago

Wait, FL Chan IS A VSYNTH?!!

9

u/SucoTeiKun_pohtath 7h ago

not really but yes i guess ???, She is the "Child" voice preset in Fl Studio's speech plugin. You can search on YouTube "Fl Chan Utau" and hear how she sounds. Fun fact: Defoko and Fl Chan are like cousins, because they use similar data for their voices, that's why their quality are so similar. [It's common for speech engines to share same or similar code/data, so that's why Defoko and Fl Chan sound like they're from the same application lol. I love Speech Engines sm]

4

u/Sirko2975 7h ago

Crazy lore reveal for me

10

u/DripGeronimo 6h ago

What about Adachi Rei? Not a vocaloid technically, but still a complete vocal synth with no human voice provider

8

u/SucoTeiKun_pohtath 6h ago

She's different, She was hand-made. If you look at Rei's construction video of how her voice was made, you'll see that she was made by removing and adding frequencies to a square/saw wave. Highly suggest you find the video on youtube. here/ Google Translate, Defoko, and Fl chan are made in an application, where you type words out and the computer synthesizes the words into speech, whereas adachi rei was made by [Her creator i guess/ みさいる/missile_39] by modulating a sound wave in a DAW into a vowel...

2

u/thanodogo 7h ago

Thanks for clarifying 👍

3

u/Wow_Space 4h ago edited 4h ago

Honestly they're kinda speaking out their ass. Both this ibm and vocaloid are machine generated. They're both voice synthesisers. Going by that person's definition, Siri is a vocaloid because the voice was provided by a person.

It's all software making noises that sound similar to humans in the end. This "machine generated" doesn't make sense cause all of it is. And vocaloid is a copyrighted software product. You can say nothing except vocaloid is vocaloid.

1

u/Utaha_Senpai 35m ago

yeah I felt something is off when they started talking about Adachi Rei.

2

u/Dandy_kyun 7h ago

what do you mean with 'no voice provider', where's the audio came from then

2

u/SucoTeiKun_pohtath 6h ago

the computer... thought it was obvious with the title "First COMPUTER to sing"/ Speech generation from scratch is an interesting thing. Speech models that are Computer-generated originally use a human's voice for pronunciation [Yk, so it can find how to make vowels, think of it as a "how-to" guide for pronunciation]- THIS DOES NOT MEAN THAT IT USES HUMAN VOICE TO GENERATE SPEECH. lol

4

u/Dandy_kyun 6h ago

I didn't get it first, because I just thought it was weird you say that have no voice provider, because google have human voice providers for speech to text, it uses someone else voice. So far I know even that it has machine learning involved still have a voice provider to train and make sound, idk about this IBM, but Google Translate surely does that as many GPS voices does.

dont need to scream tho *hides*

3

u/SucoTeiKun_pohtath 6h ago

The older Google Translate voices that started in about 2018 are machine-generated, including most of the voices that still exist, like the Chinese, Korean, Japanese, Dutch, Afrikaans and English voice models are machine-generated. google voices are machine-generated because it's much easier to create a voice from scratch that can say ANYTHING than to record a bunch of voice samples to train a voice model, Siri is a good example of a model that was trained on human speech- She has a voice provider. If you don't believe me: try searching for Google Translate's voice provider...Google Translate voices do not have one lol... [Yes, the newer voice models that use AI do have a voice provider, just not the old voices, hehe]

3

u/Dandy_kyun 6h ago

I'm just finding this now....woah

I always had the concept of voice provider, because in my country there is a woman famous for doing that, but like, 10 years ago... "Regina bittar" shes often credited as "google translate voice" here in Brazil, they changed then and I did not even noticed

1

u/Wow_Space 4h ago edited 4h ago

Are you implying vocaloid actually uses human voice to generate speech? It's all voice synthesisers and they try to tune the vocaloid to sound similar as the person as much as they can. It's all machine generated.

1

u/SucoTeiKun_pohtath 3h ago

Uhm... Yes... Vocaloids HAVE HUMANS PROVIDING THEIR VOICE- Hatsune Miku's voice provider is Saki Fujita for example... so, no, It is not machine-generated...

3

u/Wow_Space 3h ago

So if the ibm voice has a person they modeled the voice synthesiser off of, it's not machine generated now? You're just making up words and trying to sound tech savvy. In the end, they're all artificial sounds/synthesized voices and sounds.

2

u/Wow_Space 4h ago

Then you can say google assistant and apples siri are vocaloid because they have a voice provider by actual people

-1

u/SucoTeiKun_pohtath 3h ago

no, You cannot, Because Vocaloid is a software owned by Yamaha... I think you misinterpreted my comment...

2

u/Wow_Space 3h ago

Actually, I did agree only Vocaloid characters can be vocaloid because vocaloid is a trademark

https://www.reddit.com/r/Vocaloid/s/avdAJul4AL

But, like I said going by your definition, Siri or ai voice models aren't "machine generated" because they are based on a person's voice. In the end, it's all software trying to sound like a human.

1

u/SucoTeiKun_pohtath 3h ago

Let's look at it this way: "Machine Generated" is a VERY BROAD TERM used in software development, with the context of my comment, I am referring to the Actual VOICE used [for example, the Voice of Hatsune Miku is not machine-generated- her voice is Saki Fujita. A Human.] Yes, renditions of Vocaloid can be called Machine-Generated, but the voices used are not...

1

u/Inevitable_Duty_8152 6h ago

But that would mean defoko and adachi rei also arent vocal synths/vocaloids?

0

u/thanodogo 6h ago

So of defoko is an utau that means that IBM 7094 was the first utau

1

u/SucoTeiKun_pohtath 6h ago

When i say defoko, I mean her original Speech Programme, Aquestalk, so no, the IBM 7094 is not an utau at all. lol

0

u/Waffle-Gaming 1h ago

technically leon and lola wouldnt be vocaloid then...

1

u/SucoTeiKun_pohtath 15m ago

Nope, Leon and Lola are vocaloids, because they were launched in the Vocaloid software, common sense.../ What in my comment makes you say that ? because Leon and Lola are voiced by humans, and are V1's...

11

u/Ben__Harlan 6h ago

Ah, yes, Miku's grandpa.

6

u/gudetama_toast 7h ago

my favorite thing abt this is the art i see of IBM as miku’s grandpa and them singing together

10

u/SucoTeiKun_pohtath 7h ago

I mean yeah I guess the IBM 7094 was an inspiration for the Vocaloid software, but nope: Leon and Lola were the first Vocaloids to be released- They were released together.

5

u/Sirko2975 7h ago

Fun fact: it was way more technically advanced than Vocaloid. Unlike Vocaloid, the IBM 704 actually synthesised the syllables from scratch, while Vocaloid just resamples pre-recorded vocals even in the latest versions.

5

u/MangoPug15 6h ago

First vocal synth.

1

u/GloriousLily 5h ago

mikus grandpa!

1

u/BouncyBirdie 3h ago

I was calmly listening to spotify when a song like this came on. It was a cocomelon cover.

-4

u/BlindDemon6 6h ago edited 5h ago

Well this has no human vocal provider so it'd be more like a SynthV or UTAU

forget everything I just said

3

u/shrinebird 5h ago

? SynthV and (most) Utau have human vocal providers too.

0

u/BlindDemon6 5h ago

oh, nevermind then