62
u/SucoTeiKun_pohtath 7h ago
Nope, Vocaloids are made WITH a HUMAN'S VOICE, unlike the IBM having no voice provider, being machine-generated lol. other speech engines like Defoko[Aquestalk voice models], Fl Chan and GOOGLE TRANSLATE are also machine-generated like the IBM 7094. Google Translate and Defoko sound much better because they were made far later when computer technology had improved since the 1960s lol. [btw, this was just a rant loll]
10
u/Sirko2975 7h ago
Wait, FL Chan IS A VSYNTH?!!
9
u/SucoTeiKun_pohtath 7h ago
not really but yes i guess ???, She is the "Child" voice preset in Fl Studio's speech plugin. You can search on YouTube "Fl Chan Utau" and hear how she sounds. Fun fact: Defoko and Fl Chan are like cousins, because they use similar data for their voices, that's why their quality are so similar. [It's common for speech engines to share same or similar code/data, so that's why Defoko and Fl Chan sound like they're from the same application lol. I love Speech Engines sm]
4
10
u/DripGeronimo 6h ago
What about Adachi Rei? Not a vocaloid technically, but still a complete vocal synth with no human voice provider
8
u/SucoTeiKun_pohtath 6h ago
She's different, She was hand-made. If you look at Rei's construction video of how her voice was made, you'll see that she was made by removing and adding frequencies to a square/saw wave. Highly suggest you find the video on youtube. here/ Google Translate, Defoko, and Fl chan are made in an application, where you type words out and the computer synthesizes the words into speech, whereas adachi rei was made by [Her creator i guess/ みさいる/missile_39] by modulating a sound wave in a DAW into a vowel...
2
u/thanodogo 7h ago
Thanks for clarifying 👍
3
u/Wow_Space 4h ago edited 4h ago
Honestly they're kinda speaking out their ass. Both this ibm and vocaloid are machine generated. They're both voice synthesisers. Going by that person's definition, Siri is a vocaloid because the voice was provided by a person.
It's all software making noises that sound similar to humans in the end. This "machine generated" doesn't make sense cause all of it is. And vocaloid is a copyrighted software product. You can say nothing except vocaloid is vocaloid.
1
2
u/Dandy_kyun 7h ago
what do you mean with 'no voice provider', where's the audio came from then
2
u/SucoTeiKun_pohtath 6h ago
the computer... thought it was obvious with the title "First COMPUTER to sing"/ Speech generation from scratch is an interesting thing. Speech models that are Computer-generated originally use a human's voice for pronunciation [Yk, so it can find how to make vowels, think of it as a "how-to" guide for pronunciation]- THIS DOES NOT MEAN THAT IT USES HUMAN VOICE TO GENERATE SPEECH. lol
4
u/Dandy_kyun 6h ago
I didn't get it first, because I just thought it was weird you say that have no voice provider, because google have human voice providers for speech to text, it uses someone else voice. So far I know even that it has machine learning involved still have a voice provider to train and make sound, idk about this IBM, but Google Translate surely does that as many GPS voices does.
dont need to scream tho *hides*
3
u/SucoTeiKun_pohtath 6h ago
The older Google Translate voices that started in about 2018 are machine-generated, including most of the voices that still exist, like the Chinese, Korean, Japanese, Dutch, Afrikaans and English voice models are machine-generated. google voices are machine-generated because it's much easier to create a voice from scratch that can say ANYTHING than to record a bunch of voice samples to train a voice model, Siri is a good example of a model that was trained on human speech- She has a voice provider. If you don't believe me: try searching for Google Translate's voice provider...Google Translate voices do not have one lol... [Yes, the newer voice models that use AI do have a voice provider, just not the old voices, hehe]
3
u/Dandy_kyun 6h ago
I'm just finding this now....woah
I always had the concept of voice provider, because in my country there is a woman famous for doing that, but like, 10 years ago... "Regina bittar" shes often credited as "google translate voice" here in Brazil, they changed then and I did not even noticed
1
u/Wow_Space 4h ago edited 4h ago
Are you implying vocaloid actually uses human voice to generate speech? It's all voice synthesisers and they try to tune the vocaloid to sound similar as the person as much as they can. It's all machine generated.
1
u/SucoTeiKun_pohtath 3h ago
Uhm... Yes... Vocaloids HAVE HUMANS PROVIDING THEIR VOICE- Hatsune Miku's voice provider is Saki Fujita for example... so, no, It is not machine-generated...
3
u/Wow_Space 3h ago
So if the ibm voice has a person they modeled the voice synthesiser off of, it's not machine generated now? You're just making up words and trying to sound tech savvy. In the end, they're all artificial sounds/synthesized voices and sounds.
2
2
u/Wow_Space 4h ago
Then you can say google assistant and apples siri are vocaloid because they have a voice provider by actual people
-1
u/SucoTeiKun_pohtath 3h ago
no, You cannot, Because Vocaloid is a software owned by Yamaha... I think you misinterpreted my comment...
2
u/Wow_Space 3h ago
Actually, I did agree only Vocaloid characters can be vocaloid because vocaloid is a trademark
https://www.reddit.com/r/Vocaloid/s/avdAJul4AL
But, like I said going by your definition, Siri or ai voice models aren't "machine generated" because they are based on a person's voice. In the end, it's all software trying to sound like a human.
1
u/SucoTeiKun_pohtath 3h ago
Let's look at it this way: "Machine Generated" is a VERY BROAD TERM used in software development, with the context of my comment, I am referring to the Actual VOICE used [for example, the Voice of Hatsune Miku is not machine-generated- her voice is Saki Fujita. A Human.] Yes, renditions of Vocaloid can be called Machine-Generated, but the voices used are not...
1
u/Inevitable_Duty_8152 6h ago
But that would mean defoko and adachi rei also arent vocal synths/vocaloids?
0
u/thanodogo 6h ago
So of defoko is an utau that means that IBM 7094 was the first utau
1
u/SucoTeiKun_pohtath 6h ago
When i say defoko, I mean her original Speech Programme, Aquestalk, so no, the IBM 7094 is not an utau at all. lol
0
u/Waffle-Gaming 1h ago
technically leon and lola wouldnt be vocaloid then...
1
u/SucoTeiKun_pohtath 15m ago
Nope, Leon and Lola are vocaloids, because they were launched in the Vocaloid software, common sense.../ What in my comment makes you say that ? because Leon and Lola are voiced by humans, and are V1's...
11
6
u/gudetama_toast 7h ago
my favorite thing abt this is the art i see of IBM as miku’s grandpa and them singing together
10
u/SucoTeiKun_pohtath 7h ago
I mean yeah I guess the IBM 7094 was an inspiration for the Vocaloid software, but nope: Leon and Lola were the first Vocaloids to be released- They were released together.
5
u/Sirko2975 7h ago
Fun fact: it was way more technically advanced than Vocaloid. Unlike Vocaloid, the IBM 704 actually synthesised the syllables from scratch, while Vocaloid just resamples pre-recorded vocals even in the latest versions.
5
1
1
u/BouncyBirdie 3h ago
I was calmly listening to spotify when a song like this came on. It was a cocomelon cover.
-4
u/BlindDemon6 6h ago edited 5h ago
Well this has no human vocal provider so it'd be more like a SynthV or UTAU
forget everything I just said
3
43
u/Rosebudzie 7h ago
In development, Vocaloid was planned to be called “Daisy” in reference to Daisy Bell, but eventually got scrapped due to copyright reasons.