{"componentChunkName":"component---src-templates-acg-portal-new-template-tsx","path":"/tmqeuziip","result":{"data":{"markdownRemark":{"html":"<h2 id=\"简介\"><a href=\"#%E7%AE%80%E4%BB%8B\" aria-label=\"简介 permalink\" class=\"anchor\"><svg aria-hidden=\"true\" focusable=\"false\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>简介</h2>\n<p>基于pyannote-audio的说话人分离处理器</p>\n<h3 id=\"功能描述\"><a href=\"#%E5%8A%9F%E8%83%BD%E6%8F%8F%E8%BF%B0\" aria-label=\"功能描述 permalink\" class=\"anchor\"><svg aria-hidden=\"true\" focusable=\"false\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>功能描述</h3>\n<ul>\n<li>多说话人语音分离与时间戳标注</li>\n<li>输出带说话人标签的语音分段元数据</li>\n<li>输入：音频</li>\n<li>输出：包含以下字段的结构化数据</li>\n<li>speaker: 说话人唯一标识</li>\n<li>start: 语音段开始时间（秒）</li>\n<li>end: 语音段结束时间（秒）</li>\n<li>使用<code>speaker-diarization-3.1</code>说话人分离模型</li>\n<li>支持GPU加速推理（需配置CUDA环境）</li>\n</ul>\n<h2 id=\"算子参数\"><a href=\"#%E7%AE%97%E5%AD%90%E5%8F%82%E6%95%B0\" aria-label=\"算子参数 permalink\" class=\"anchor\"><svg aria-hidden=\"true\" focusable=\"false\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>算子参数</h2>\n<h3 id=\"输入\"><a href=\"#%E8%BE%93%E5%85%A5\" aria-label=\"输入 permalink\" class=\"anchor\"><svg aria-hidden=\"true\" focusable=\"false\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>输入</h3>\n<table>\n<thead>\n<tr>\n<th>输入</th>\n<th>含义</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>audios</td>\n<td>包含多个音频数据的数组</td>\n</tr>\n</tbody>\n</table>\n<h3 id=\"输出\"><a href=\"#%E8%BE%93%E5%87%BA\" aria-label=\"输出 permalink\" class=\"anchor\"><svg aria-hidden=\"true\" focusable=\"false\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>输出</h3>\n<table>\n<thead>\n<tr>\n<th>输出</th>\n<th>含义</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>result</td>\n<td>pa.Array: 包含说话人分离结果的数组</td>\n</tr>\n</tbody>\n</table>\n<h3 id=\"参数\"><a href=\"#%E5%8F%82%E6%95%B0\" aria-label=\"参数 permalink\" class=\"anchor\"><svg aria-hidden=\"true\" focusable=\"false\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>参数</h3>\n<table>\n<thead>\n<tr>\n<th>参数名称</th>\n<th>类型</th>\n<th>默认值</th>\n<th>描述</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>model_path</td>\n<td>str</td>\n<td>'/opt/aihc/models'</td>\n<td>模型文件所在的路径</td>\n</tr>\n<tr>\n<td>rank</td>\n<td>int</td>\n<td>0</td>\n<td>用于指定使用的 GPU 设备编号</td>\n</tr>\n</tbody>\n</table>\n<h2 id=\"调用示例\"><a href=\"#%E8%B0%83%E7%94%A8%E7%A4%BA%E4%BE%8B\" aria-label=\"调用示例 permalink\" class=\"anchor\"><svg aria-hidden=\"true\" focusable=\"false\" height=\"16\" version=\"1.1\" viewBox=\"0 0 16 16\" width=\"16\"><path fill-rule=\"evenodd\" d=\"M4 9h1v1H4c-1.5 0-3-1.69-3-3.5S2.55 3 4 3h4c1.45 0 3 1.69 3 3.5 0 1.41-.91 2.72-2 3.25V8.59c.58-.45 1-1.27 1-2.09C10 5.22 8.98 4 8 4H4c-.98 0-2 1.22-2 2.5S3 9 4 9zm9-3h-1v1h1c1 0 2 1.22 2 2.5S13.98 12 13 12H9c-.98 0-2-1.22-2-2.5 0-.83.42-1.64 1-2.09V6.25c-1.09.53-2 1.84-2 3.25C6 11.31 7.55 13 9 13h4c1.45 0 3-1.69 3-3.5S14.5 6 13 6z\"></path></svg></a>调用示例</h2>\n\n    <div class=\"code-block-wrapper\">\n        <div class=\"code-block\">\n            <div class=\"code-block-header\">\n                <span class=\"code-block-name\">Python</span>\n                <button class=\"code-copy-btn\" data-tooltip-text=\"\">\n                    <svg xmlns=\"http://www.w3.org/2000/svg\" width=\"16\" height=\"16\" viewBox=\"0 0 16 16\" fill=\"none\"> <path fill-rule=\"evenodd\" clip-rule=\"evenodd\" d=\"M5.57894 3.45614C5.57894 3.38832 5.63392 3.33333 5.70175 3.33333H12.5439C12.6117 3.33333 12.6667 3.38832 12.6667 3.45614V10.2982C12.6667 10.3661 12.6117 10.4211 12.5439 10.4211H11.7544V5.70175C11.7544 4.89754 11.1025 4.24561 10.2982 4.24561H5.57894V3.45614ZM4.24561 4.24561V3.45614C4.24561 2.65194 4.89754 2 5.70175 2H12.5439C13.3481 2 14 2.65194 14 3.45614V10.2982C14 11.1025 13.3481 11.7544 12.5439 11.7544H11.7544V12.5439C11.7544 13.3481 11.1025 14 10.2982 14H3.45614C2.65194 14 2 13.3481 2 12.5439V5.70175C2 4.89754 2.65194 4.24561 3.45614 4.24561H4.24561ZM3.33333 5.70175C3.33333 5.63392 3.38832 5.57894 3.45614 5.57894H10.2982C10.3661 5.57894 10.4211 5.63392 10.4211 5.70175V12.5439C10.4211 12.6117 10.3661 12.6667 10.2982 12.6667H3.45614C3.38832 12.6667 3.33333 12.6117 3.33333 12.5439V5.70175Z\" fill=\"currentColor\"></path> </svg>\n                    复制\n                </button>\n            </div>\n            <div class=\"code-block-content\">\n                <pre class=\"language-python\"><code><span class=\"line-number\">1</span><span class=\"token keyword\">from</span> __future__ <span class=\"token keyword\">import</span> annotations\n<span class=\"line-number\">2</span>\n<span class=\"line-number\">3</span><span class=\"token keyword\">import</span> os\n<span class=\"line-number\">4</span>\n<span class=\"line-number\">5</span><span class=\"token keyword\">import</span> daft\n<span class=\"line-number\">6</span><span class=\"token keyword\">from</span> daft <span class=\"token keyword\">import</span> col\n<span class=\"line-number\">7</span>\n<span class=\"line-number\">8</span><span class=\"token keyword\">from</span> daft<span class=\"token punctuation\">.</span>aihc<span class=\"token punctuation\">.</span>common<span class=\"token punctuation\">.</span>udf <span class=\"token keyword\">import</span> aihc_udf\n<span class=\"line-number\">9</span><span class=\"token keyword\">from</span> daft<span class=\"token punctuation\">.</span>aihc<span class=\"token punctuation\">.</span>functions<span class=\"token punctuation\">.</span>audio<span class=\"token punctuation\">.</span>audio_speaker_diarization <span class=\"token keyword\">import</span> AudioSpeakerDiarization\n<span class=\"line-number\">10</span>\n<span class=\"line-number\">11</span><span class=\"token keyword\">if</span> __name__ <span class=\"token operator\">==</span> <span class=\"token string\">\"__main__\"</span><span class=\"token punctuation\">:</span>\n<span class=\"line-number\">12</span>    <span class=\"token keyword\">if</span> os<span class=\"token punctuation\">.</span>getenv<span class=\"token punctuation\">(</span><span class=\"token string\">\"DAFT_RUNNER\"</span><span class=\"token punctuation\">,</span> <span class=\"token string\">\"native\"</span><span class=\"token punctuation\">)</span> <span class=\"token operator\">==</span> <span class=\"token string\">\"ray\"</span><span class=\"token punctuation\">:</span>\n<span class=\"line-number\">13</span>        <span class=\"token keyword\">import</span> ray\n<span class=\"line-number\">14</span>        ray<span class=\"token punctuation\">.</span>init<span class=\"token punctuation\">(</span>dashboard_host<span class=\"token operator\">=</span><span class=\"token string\">\"0.0.0.0\"</span><span class=\"token punctuation\">,</span> ignore_reinit_error<span class=\"token operator\">=</span><span class=\"token boolean\">True</span><span class=\"token punctuation\">)</span>\n<span class=\"line-number\">15</span>        daft<span class=\"token punctuation\">.</span>set_runner_ray<span class=\"token punctuation\">(</span><span class=\"token punctuation\">)</span>\n<span class=\"line-number\">16</span>    daft<span class=\"token punctuation\">.</span>set_execution_config<span class=\"token punctuation\">(</span>actor_udf_ready_timeout<span class=\"token operator\">=</span><span class=\"token number\">6000</span><span class=\"token punctuation\">,</span> min_cpu_per_task<span class=\"token operator\">=</span><span class=\"token number\">0</span><span class=\"token punctuation\">)</span>\n<span class=\"line-number\">17</span>\n<span class=\"line-number\">18</span>    <span class=\"token comment\"># TODO: 根据实际场景准备样本数据</span>\n<span class=\"line-number\">19</span>    samples <span class=\"token operator\">=</span> <span class=\"token punctuation\">{</span><span class=\"token string\">\"audios\"</span><span class=\"token punctuation\">:</span> <span class=\"token punctuation\">[</span><span class=\"token punctuation\">.</span><span class=\"token punctuation\">.</span><span class=\"token punctuation\">.</span><span class=\"token punctuation\">]</span><span class=\"token punctuation\">}</span>\n<span class=\"line-number\">20</span>    ds <span class=\"token operator\">=</span> daft<span class=\"token punctuation\">.</span>from_pydict<span class=\"token punctuation\">(</span>samples<span class=\"token punctuation\">)</span>\n<span class=\"line-number\">21</span>    constructor_kwargs <span class=\"token operator\">=</span> <span class=\"token punctuation\">{</span>\n<span class=\"line-number\">22</span>        <span class=\"token string\">\"model_path\"</span><span class=\"token punctuation\">:</span> <span class=\"token string\">'/opt/aihc/models'</span><span class=\"token punctuation\">,</span>\n<span class=\"line-number\">23</span>        <span class=\"token string\">\"rank\"</span><span class=\"token punctuation\">:</span> <span class=\"token number\">0</span><span class=\"token punctuation\">,</span>\n<span class=\"line-number\">24</span>    <span class=\"token punctuation\">}</span>\n<span class=\"line-number\">25</span>    ds <span class=\"token operator\">=</span> ds<span class=\"token punctuation\">.</span>with_column<span class=\"token punctuation\">(</span>\n<span class=\"line-number\">26</span>        <span class=\"token string\">\"result\"</span><span class=\"token punctuation\">,</span>\n<span class=\"line-number\">27</span>        aihc_udf<span class=\"token punctuation\">(</span>\n<span class=\"line-number\">28</span>            AudioSpeakerDiarization<span class=\"token punctuation\">,</span>\n<span class=\"line-number\">29</span>            construct_args<span class=\"token operator\">=</span>constructor_kwargs<span class=\"token punctuation\">,</span>\n<span class=\"line-number\">30</span>            num_cpus<span class=\"token operator\">=</span><span class=\"token number\">1</span><span class=\"token punctuation\">,</span>\n<span class=\"line-number\">31</span>            concurrency<span class=\"token operator\">=</span><span class=\"token number\">4</span><span class=\"token punctuation\">,</span>\n<span class=\"line-number\">32</span>            batch_size<span class=\"token operator\">=</span><span class=\"token number\">8</span><span class=\"token punctuation\">,</span>\n<span class=\"line-number\">33</span>        <span class=\"token punctuation\">)</span><span class=\"token punctuation\">(</span>col<span class=\"token punctuation\">(</span><span class=\"token string\">\"audios\"</span><span class=\"token punctuation\">)</span><span class=\"token punctuation\">)</span><span class=\"token punctuation\">,</span>\n<span class=\"line-number\">34</span>    <span class=\"token punctuation\">)</span>\n<span class=\"line-number\">35</span>    ds<span class=\"token punctuation\">.</span>show<span class=\"token punctuation\">(</span><span class=\"token punctuation\">)</span></code></pre>\n            </div>\n        </div>\n    </div>\n  ","fields":{"slug":"tmqeuziip","title":"说话人分离","date":"2026-06-15","extractedHeadings":[]},"headings":[{"value":"简介","depth":2},{"value":"功能描述","depth":3},{"value":"算子参数","depth":2},{"value":"输入","depth":3},{"value":"输出","depth":3},{"value":"参数","depth":3},{"value":"调用示例","depth":2}]}},"pageContext":{"isCreatedByStatefulCreatePages":false,"slug":"tmqeuziip","prev":{"id":"2mqeuzici","name":"音频人声分离","path":"2mqeuzici","filePath":"操作指南/AI数据处理/算子列表/音频/音频人声分离.md","seo":null,"parentIds":["ilib2qygp","Ymo88m8hi","Imob3m6so","zmob46e1c"],"parents":[{"id":"ilib2qygp","documentId":"bfa43a8b-968a-41a1-8c9d-906507eeaed9","name":"操作指南","repoName":"AIHC","filePath":"操作指南","disabled":false,"path":"ilib2qygp","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":null},{"id":"Ymo88m8hi","documentId":"c8cb5e38-f8c5-40f4-a424-b0c7895f0c0a","name":"AI数据处理","repoName":"AIHC","filePath":"操作指南/AI数据处理","disabled":false,"path":"Ymo88m8hi","lastMergeTime":"2026-04-21 14:23:10","isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""},{"id":"Imob3m6so","documentId":"fe548e34-6659-4ff5-86f6-eee2c43aec90","name":"算子列表","repoName":"AIHC","filePath":"操作指南/AI数据处理/算子列表","disabled":false,"path":"Imob3m6so","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""},{"id":"zmob46e1c","documentId":"886f1d97-777e-4f0c-81ae-e69bf16653f0","name":"音频","repoName":"AIHC","filePath":"操作指南/AI数据处理/算子列表/音频","disabled":false,"path":"zmob46e1c","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""}]},"next":{"id":"6mqeuziul","name":"语音识别(Whisper)","path":"6mqeuziul","filePath":"操作指南/AI数据处理/算子列表/音频/语音识别(Whisper).md","seo":null,"parentIds":["ilib2qygp","Ymo88m8hi","Imob3m6so","zmob46e1c"],"parents":[{"id":"ilib2qygp","documentId":"bfa43a8b-968a-41a1-8c9d-906507eeaed9","name":"操作指南","repoName":"AIHC","filePath":"操作指南","disabled":false,"path":"ilib2qygp","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":null},{"id":"Ymo88m8hi","documentId":"c8cb5e38-f8c5-40f4-a424-b0c7895f0c0a","name":"AI数据处理","repoName":"AIHC","filePath":"操作指南/AI数据处理","disabled":false,"path":"Ymo88m8hi","lastMergeTime":"2026-04-21 14:23:10","isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""},{"id":"Imob3m6so","documentId":"fe548e34-6659-4ff5-86f6-eee2c43aec90","name":"算子列表","repoName":"AIHC","filePath":"操作指南/AI数据处理/算子列表","disabled":false,"path":"Imob3m6so","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""},{"id":"zmob46e1c","documentId":"886f1d97-777e-4f0c-81ae-e69bf16653f0","name":"音频","repoName":"AIHC","filePath":"操作指南/AI数据处理/算子列表/音频","disabled":false,"path":"zmob46e1c","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""}]},"parents":[{"id":"ilib2qygp","documentId":"bfa43a8b-968a-41a1-8c9d-906507eeaed9","name":"操作指南","repoName":"AIHC","filePath":"操作指南","disabled":false,"path":"ilib2qygp","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":null},{"id":"Ymo88m8hi","documentId":"c8cb5e38-f8c5-40f4-a424-b0c7895f0c0a","name":"AI数据处理","repoName":"AIHC","filePath":"操作指南/AI数据处理","disabled":false,"path":"Ymo88m8hi","lastMergeTime":"2026-04-21 14:23:10","isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""},{"id":"Imob3m6so","documentId":"fe548e34-6659-4ff5-86f6-eee2c43aec90","name":"算子列表","repoName":"AIHC","filePath":"操作指南/AI数据处理/算子列表","disabled":false,"path":"Imob3m6so","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""},{"id":"zmob46e1c","documentId":"886f1d97-777e-4f0c-81ae-e69bf16653f0","name":"音频","repoName":"AIHC","filePath":"操作指南/AI数据处理/算子列表/音频","disabled":false,"path":"zmob46e1c","lastMergeTime":null,"isApiDoc":null,"httpMethod":null,"seo":null,"sourceOrgName":null,"sourceRepoName":null,"sourceDocumentId":""}],"specificSeo":null}}}