【OpenAIの新モデルリリースがリーク！サム・アルトマンがAGI、UBI、GPT-5、エージェントについて語る】英語解説を日本語で読む【2024年5月11日｜@Wes Roth】

2024年5月12日 00:03

この動画では、OpenAIの新しいモデルリリースに関する情報とサム・アルトマンのAI技術に対する見解を紹介しています。OpenAIは月曜日に新技術を発表する予定で、GPT-4 autoを参照するコードからエージェント能力やAI音声アシスタントに関連していると推測されています。リークによると、新モデルは音声認識とテキスト読み上げを統合し、画像とオーディオの入力をより良く理解することで、システムの速度と効率が向上するとのことです。サム・アルトマンは、新しいモデルのリリース方法やGPT-4の改善について言及し、音声がコンピューターを使う新しい方法になる可能性を示唆しました。また、自分自身の拡張としてのエージェントと、優秀な上級社員としてのエージェントという2つのアプローチについて語りました。最後に、アルトマンはユニバーサルベーシックコンピュートという概念を提示し、従来のユニバーサルベーシックインカムよりも良い方法になるかもしれないと示唆しました。未来では、誰もがAIコンピュートの一部を得て、それを使ったり、再販したり、研究のために寄付したりできるようになるかもしれません。
公開日：2024年5月11日
※動画を再生してから読むのがオススメです。

There's been some crazy news happening over the last 24 hours, most of them surrounding OpenAI and the new thing that they're about to unveil.

ここ24時間の間に、主にOpenAIとその新しい発表を取り巻くクレイジーなニュースがありました。

So first of all, there's a lot of speculation as to what the new release will be.

まず第一に、新しいリリースが何であるかについて多くの憶測があります。

OpenAI is saying that they will have a live stream on OpenAI.com at 10am Pacific Monday to demo some updates.

OpenAIは月曜日の太平洋時間午前10時にOpenAI.comでライブストリームを行い、いくつかのアップデートをデモすると言っています。

Some outlook clarifies it's not GPT-5, it's not a search engine.

いくつかの見通しでは、GPT-5ではなく、検索エンジンでもないことを明確にしています。

There were a number of things pointing to potentially there being a search engine and we do believe that one is in the works.

検索エンジンの可能性を示唆するいくつかのことがあり、私たちはそれが開発中だと信じています。

But this isn't it.

しかし、これはそうではありません。

And Simotman's saying it feels like magic to him, whatever this new thing is.

そして、シモットマンは、この新しいものが何であれ、自分にとって魔法のように感じると言っています。

There are some unconfirmed leaks that there's some code that references GPT-4 auto.

GPT-4 autoを参照するいくつかのコードがあるという未確認のリークがあります。

What could auto mean?

autoとは何を意味するのでしょうか？

Could it be agentic capabilities, AI agents?

エージェント能力、AIエージェントである可能性はあるのでしょうか？

I'm guessing that the new breakthroughs have to do with voice.

私は、新しいブレークスルーは音声に関係していると推測しています。

But more specifically with voice agents.

しかし、より具体的には音声エージェントに関係しています。

OpenAI develops AI voice assistant as it chases Google and Apple.

OpenAIはGoogleとAppleを追いかけてAI音声アシスタントを開発しています。

You might think this was from a while ago because we already had kind of voice assistants in GPT-4, but this is main tent just a few days before the new announcement.

GPT-4ですでに音声アシスタントのようなものがあったので、これはしばらく前のことだと思うかもしれませんが、これは新しい発表の数日前のメインテントです。

We'll come back to this in a second.

これについてはすぐに戻ります。

But the point is this, since a number of people have mentioned that agents will be a potential focus at this streaming live event.

しかしポイントはこれです。多くの人がエージェントがこのストリーミングライブイベントの潜在的な焦点になるだろうと言及しているので。

But getting back to this, OpenAI develops AI voice assistants.

しかし、AI音声アシスタントの開発に戻ると、

We know some people talked about seeing the next iteration of the GPT model, whether that's call a GPT-5 or something else, and they're saying it's going to include better reasoning.

GPTモデルの次のイテレーションを見た人がいて、それがGPT-5と呼ばれるか何かわかりませんが、より良い推論が含まれると言っているのを知っています。

And they're saying this would be the next step in developing something like the virtual assistant in the movie Her.

そして、これは映画「Her」のような仮想アシスタントを開発する次のステップになるだろうと言っています。

These assistants would be able to act like a tutor for kids working on various papers or math problems, but also as automated customer service agents.

これらのアシスタントは、さまざまな論文や数学の問題に取り組む子供たちのチューターとして機能することができますが、自動化されたカスタマーサービスエージェントとしても機能します。

And this sentence stands out to me as why potentially this could be an on-device model.

そして、この文は、なぜこれがオンデバイスモデルになる可能性があるかという点で私の目を引きます。

Microsoft could potentially use OpenAI's new AI to improve its own voice assistant or try to make it compact enough to run on small devices, including wearables with front facing cameras that can capture the customer's surrounding.

Microsoftは、OpenAIの新しいAIを使用して独自の音声アシスタントを改善したり、顧客の周囲をキャプチャできるフロントカメラを備えたウェアラブルデバイスを含む小さなデバイスで実行できるように十分にコンパクトにすることができます。

And just as I was about to wrap up the video, somebody posted this.

そして、ビデオを終了しようとしていたちょうどそのとき、誰かがこれを投稿しました。

This is potentially a leak of what's coming on Monday.

これは、月曜日に来るものの潜在的なリークです。

Again, we don't know the source yet, but it sounds like this will be published through some credible source later today.

繰り返しになりますが、まだソースはわかりませんが、今日の後半に信頼できるソースから公開されるようです。

They're saying they expect AI voice assistants with enhanced capabilities.

彼らは、機能が向上したAI音声アシスタントを期待していると言っています。

Again, comparing it to the movie Her, the integration of audio and visual features in one model, unlike previous models which treated audio transcription and text to speech as separate features, the new model integrates these capabilities.

再び映画「Her」と比較して、以前のモデルが音声の文字起こしとテキスト読み上げを別々の機能として扱っていたのとは異なり、新しいモデルはこれらの機能を統合しているオーディオとビジュアル機能の統合です。

This integration allows the model to have a better understanding of both image and audio inputs, enhancing the overall speed and efficiency of the system.

この統合により、モデルは画像とオーディオの両方の入力をよりよく理解でき、システム全体の速度と効率が向上します。

This will be rolling out on Monday.

これは月曜日にリリースされます。

And initially, this new technology will be available via the cloud, targeting uses such as improving automated customer service.

そして当初、この新しいテクノロジーはクラウドを介して利用可能になり、自動化されたカスタマーサービスの改善などの用途をターゲットにしています。

The technology is too large to run on personal devices yet, but could be integrated into devices via cloud computing in the near term.

このテクノロジーはまだ個人のデバイスで実行するには大きすぎますが、近い将来、クラウドコンピューティングを介してデバイスに統合できる可能性があります。

There's also mention of GPT-5, which is expected to be a significant improvement over GPT-4 and might be released by the end of the year.

また、GPT-4から大幅な改善が期待されるGPT-5についても言及されており、年末までにリリースされる可能性があります。

And also we might be getting a new pricing tier that offers up to a 50% discount for customers who prepay, as well as the potential integration of Apple and the iPhone, which we covered earlier on this channel.

また、事前に支払った顧客に最大50％の割引を提供する新しい価格帯や、以前このチャンネルで取り上げたAppleとiPhoneの統合の可能性もあります。

Sam Albin just went on the All In podcast and helped answer some questions about AGI, the upcoming model releases, and just some basic thoughts about where this whole AI thing is going.

サム・アルビンは、All In podcastに出演し、AGI、今後のモデルリリース、そしてこのAIがどこへ向かっているのかについていくつかの基本的な考えについて質問に答えてくれました。

Here are just a few quick clips that kind of stood out to me that I thought were interesting.

私が面白いと思った、少し目立ったクリップをいくつかご紹介します。

Like, we may release it in a different way than we've released previous models.

以前のモデルとは違ったやり方でリリースするかもしれません。

Also, I don't even know if we'll call it GPT-5.

GPT-5と呼ぶかどうかもわかりません。

What I will say is a lot of people have noticed how much better GPT-4 has gotten since we've released it, and particularly over the last few months.

言えることは、多くの人がGPT-4がリリース以来、特にここ数ヶ月でどれだけ良くなったかに気づいていることです。

I think that's like a better hint of what the world looks like, where it's not the like one, two, three, four, five, six, seven, but you just use an AI system and the whole system just gets better and better fairly continuously.

それは、1、2、3、4、5、6、7のようではなく、AIシステムを使うだけで、システム全体がかなり継続的にどんどん良くなっていく世界がどのようなものかを示唆していると思います。

I think that's like both a better technological direction.

それは技術的により良い方向性だと思います。

I think that's like easier for society to adapt to.

社会がそれに適応するのが容易だと思います。

But I assume that's where we'll head.

しかし、私たちはそこに向かうと思います。

I will say that I expect lots of very capable models in the world, and like, it feels to me like we just like stumbled on a new fact of nature or science or whatever you want to call it, which is like we can create.

世界には非常に有能なモデルがたくさんあると言えます。自然や科学、あるいは何と呼ぶにしても、新しい事実にぶつかったような気がします。

You can like, I mean, I don't believe this literally, but it's like a spiritual point.

文字通りではないと思いますが、精神的な点です。

You know, intelligence is just this emergent property of matter, and that's like a rule of physics or something.

知性は物質の創発特性のようなものであり、それは物理学の法則のようなものだと言えます。

You mentioned sort of the models that can fit on a phone.

電話に収まるモデルについて少し触れましたね。

So obviously, whether that's an LLM or some SLM or something, I'm sure you're thinking about that.

それが大規模言語モデルであろうとSLMであろうと、あなたはそれについて考えていると確信しています。

But then does the device itself change?

しかし、デバイス自体は変わるのでしょうか？

I mean, does it need to be as expensive as an iPhone?

iPhone ほど高価である必要はありますか？

I'm super interested in this.

私はこれにとても興味があります。

I love like great new form factors of computing, and it feels like with every major technological advance a new thing becomes possible.

新しいコンピューティングのフォームファクターが大好きで、大きな技術的進歩があるたびに新しいことが可能になるように感じます。

iPhones are unbelievably good.

iPhoneは信じられないほど良いです。

So I think the threshold is like very high here.

だから、ここでのハードルはとても高いと思います。

Like, I think like I personally think iPhone is like the greatest piece of technology humanity has ever made.

個人的には、iPhoneは人類が今まで作った最高の技術だと思います。

It's really a wonderful product.

本当に素晴らしい製品です。

And I think voice is a hint to whatever the next thing is, like if you can get voice interaction to be really good, it feels I think that feels like a different way to use a computer.

そして、音声は次のものへのヒントだと思います。音声のやり取りを本当に良いものにできれば、コンピューターの使い方が違うように感じます。

I think what I want is just this always on like super low friction thing where I can either by voice or by text or ideally like some other it just kind of knows what I want have this like constant thing helping me throughout my day that's got like as much context on as possible.

私が欲しいのは、常にオンで、超低摩擦で、音声やテキストで、あるいは理想的にはある種の方法で私が何を望んでいるかを知っている、一日中私を助けてくれる常にあるものです。

It's like the world's greatest assistant.

それは世界最高のアシスタントのようなものです。

And it's just this like thing working to make me better and better.

そして、それは私をどんどん良くしてくれるものなのです。

There's sort of two different approaches and they don't sound that different, but I think they're like very different for how we'll design the system in practice.

2つのアプローチがあり、それほど違わないように聞こえますが、実際にシステムを設計する上で大きく異なると思います。

There's the I want an extension of myself, a ghost or an alter ego or this thing that really like is me is acting on my behalf.

1つは自分自身の拡張、幽霊、別人格、本当に自分であり、自分に代わって行動するものが欲しいというものです。

And then there's this other thing which is like, I want a great senior employee.

そしてもう1つは、素晴らしい上級社員が欲しいというものです。

It may get to know me very well.

私のことをよく知ることになるかもしれません。

I may delegate it.

私はそれに委任するかもしれません。

You know, you can like have access to my email and I'll tell you the constraints but but I think of it as this like separate entity executive agent in a way like it's out there working on your behalf.

私のメールにアクセスできるようにして、制約を伝えますが、ある意味では別のエンティティ、エグゼクティブエージェントだと考えています。自分に代わって働いてくれるものです。

I think that the agent like behavior but there's like a difference between a senior employee and an agent.

エージェントのような振る舞いがありますが、上級社員とエージェントには違いがあります。

And like I want it.

そして、私はそれが欲しいのです。

You know, I think of like my I think like a bit like one of the things that I like about a senior employee is they'll they'll push back on me.

上級社員が好きなことの1つは、彼らが私に反論してくれることだと思います。

They will sometimes not do something I ask or they're sometimes will say like I can do that thing if you want.

時々、私が求めたことをしなかったり、時々、私がそれをしてほしければできると言ったりします。

But if I do it, here's what I think would happen.

しかし、もしそうするなら、こうなると思います。

And then this and then that.

そしてこれ、そしてそれ。

And are you really sure?

本当にそれでいいのですか？

I definitely want that kind of vibe, which not not just like this thing that I ask and it does.

確かにそんな雰囲気が欲しいですね。私が求めて、それがするというだけではなく。

It can reason.

それは理由づけることができます。

And push back.

そして反論する。

It has like the kind of relationship with me that I would expect out of a really competent person that I worked with, which is different from like a sycophant.

私が一緒に働く本当に有能な人に期待するような、私との関係があります。それは、おべっか使いとは違います。

A lot of hand wringing right now, Sam, about jobs.

サム、今は仕事について多くの心配があります。

You had a lot of I think did like some sort of a testament you wrote YC about UBI and you've been.

YCでUBIについて多くの証言をしたと思います。

Our results in that come out very soon.

その結果はすぐに出ます。

I just it was a five year study that wrapped up.

5年間の研究が終了したばかりです。

So we started thinking about this in 2016, but at the same time, started thinking AI really seriously.

2016年に真剣にAIについて考え始めたのと同時に、これについて考え始めました。

And the theory was that the magnitude of the change that may come to society and jobs in the economy and sort of in some deeper sense than that, like what the social contract looks like meant that we should have many studies to study many ideas about new ways to arrange that.

社会や雇用、経済に訪れるかもしれない変化の大きさ、さらに深い意味で、社会契約がどのように見えるかということは、それを整理する新しい方法について多くのアイデアを研究するために多くの研究をすべきだという理論でした。

I also think that I'm not like a super fan of how the government has handled most policies designed to help poor people.

また、私は、貧しい人々を助けるために設計されたほとんどの政策に対する政府の対応方法のスーパーファンではありません。

And I kind of believe that if you could just give people money, they would make good decisions and the market would do its thing.

そして、人々にお金を与えるだけで、彼らは良い決断をし、市場がその役割を果たすと信じています。

And I'm very much in favor of lifting up the floor and reducing eliminating poverty.

そして、私は床を持ち上げ、貧困を減らすこと、なくすことに非常に賛成です。

But I'm interested in better ways to do that.

しかし、それを行うためのより良い方法に興味があります。

So 2016 was very long time ago.

2016年は非常に昔のことでした。

Now that we see some of the ways that AI is developing, I wonder if there's better things to do than the traditional conceptualization of UBI.

AIがどのように発展しているかを見ると、従来のUBIの概念よりも良いことができるのではないかと思います。

Like, I wonder, I wonder if the future looks something like more like universal basic compute than universal basic income.

将来は、ユニバーサルベーシックインカムというよりは、ユニバーサルベーシックコンピュートのようになるのではないでしょうか。

And everybody gets like a slice of GPT-7s compute and they can use it, they can resell it, they can donate it to somebody to use for cancer research.

そして、誰もがGPT-7のコンピュートの一部を得て、それを使うことができ、再販したり、がん研究に使うために誰かに寄付したりできます。

But but what you get is not dollars, but this like product slice.

しかし、得るものはドルではなく、この製品のスライスのようなものです。

You own like part of the productivity.

生産性の一部を所有しているのです。

People talk a lot about agents as if there's kind of this linear set of call functions that happen.

人々はエージェントについて、一種の線形の呼び出し関数のセットがあるかのように多く語ります。

But one of the things that arises in biology is networks of systems that have cross interactions that the aggregation of the system, the aggregation of the network produces an output rather than one thing calling another, that thing calling another.

しかし、生物学で生じる事柄の1つは、1つのものが別のものを呼び出し、その別のものがさらに別のものを呼び出すのではなく、相互作用のあるシステムのネットワークが、システムの集約、ネットワークの集約が出力を生み出すということです。

Do we see like an emergence in this architecture of either specialized models or network models that work together to address bigger problem sets, use reasoning?

より大きな問題セットに取り組むために協力する、専門化されたモデルやネットワークモデルのこのアーキテクチャの出現を見ているのでしょうか、推論を使用しますか？

I don't know how much reasoning is going to turn out to be a super generalizable thing.

推論がどの程度一般化可能なものになるかはわかりません。

I suspect it will, but that's more just like an intuition and a hope.

そうなることを疑っていますが、それはより直感的で希望的なものです。

And it would be nice if it worked out that way.

そういうふうにうまくいけばいいのですが。

You clearly will need specialized simulators, connectors, pieces of data, whatever.

明らかに、特殊なシミュレーター、コネクタ、データの一部などが必要になるでしょう。

But my intuition and again, I don't have this like backed up with science.

しかし、私の直感では、これを科学的に裏付けているわけではありません。

My intuition would be if we can Figure out the core of generalized reasoning, connecting that to new problem domains in the same way that humans are generalized reasoners would, I think, be doable.

一般化された推論の核心を理解できれば、人間が一般化された推論者であるのと同じように、新しい問題領域とつなげることは可能だと思います。

It's like a fast unlock.

それは速いアンロックのようなものです。

Faster unlock.

より速いアンロック。

I think so.

そう思います。

With that said, my name is Wes Roth and thank you for watching.

それでは、私の名前はウェス・ロスです。ご視聴ありがとうございました。

この記事が気に入ったらサポートをしてみませんか？