ByteArk

ホーム製品と成果会社イベント会社概要採用情報

AIの未来を切り拓く · 産業変革のための共創プラットフォーム

最先端の大規模言語モデル（LLM）とRAG技術で、ビジネスに新たな知能を。シームレスかつ即時に導入可能なAI統合基盤で、現実世界に革新的な価値をもたらします。

ソリューション＆実績

AI主導の次世代ビジネスソリューション

デジタルコマースの成長を加速する革新的AIサービス

AIレコメンデーションエンジン

一人ひとりに最適化された体験を創出するパーソナライズAI

独自アルゴリズムによる超精密ターゲティング
グローバルスケール対応のモジュール型推薦基盤
リアルタイム顧客インサイトで動的マーケティングを実現
業界最高水準のコンバージョン・エンゲージメント向上
持続的成長を支えるインテリジェントトラフィック管理

インテリジェントショッピングアシスタント

AIで高付加価値顧客を効率的かつ確実に獲得

AI駆動のターゲットプロファイリングで精度を最大化
自動最適化されたキーワード・キャンペーン運用
スマートターゲティングで獲得コストを業界最小化
即時活用可能な高度行動分析ダッシュボード
自律型A/BテストでマーケティングROIを最大化

AIコンテンツ生成

生成AIがもたらす圧倒的なクリエイティブ革新

数秒で高品質・高効果なマーケティングコピーを自動生成
ブランド一貫性と多言語対応を両立したメッセージ展開
ワンクリックで全チャネル・全シーンに最適化
人の創造力を解放し、制作コストと時間を大幅削減
業界標準を凌駕する生産性と柔軟性

AI推論アクセラレーション

ByteArk独自開発の超高性能LLM推論フレームワークは、DeepSeekモデル群に最適化。PD分離、EPLB（優先度スケジューリング）、DeepEP（並列実行）、DeepGEEM（メモリ最適化）など最先端技術を統合し、マルチGPU・マルチノード環境で50％超のスループット向上とレイテンシ半減を実現。グローバル規模のリアルタイムAI導入を可能にし、ミッションクリティカルな産業用途にも対応します。

グローバルデータセンターネットワーク

No.1 杭州データセンター

No.2 杭州データセンター

No.3 香港データセンター

No.4 シンガポールデータセンター

No.5 米国データセンター

テクノロジー＆インサイト

最新ブログを見る

Improving Word Embedding Models

This work presents an approach to improve text embedding models through contrastive fine-tuning on small datasets augmented with expert scores. It focuses on enhancing semantic textual similarity tasks and addressing text retrieval problems. The proposed method uses soft labels derived from expert-augmented scores to fine-tune embedding models, preserving their versatility and ensuring retrieval capability is improved. The paper evaluates the method using a Q&A dataset from an online shopping website and eight expert models. Results show improved performance over a benchmark model across multiple metrics on various retrieval tasks from the massive text embedding benchmark (MTEB). The method is cost-effective and practical for real-world applications, especially when labeled data is scarce. Table 1 and 2 present the evaluation of nDCG@10 and mAP@10 metrics, respectively, for different models across various datasets from MTEB retrieval tasks. The average nDCG@10 scores for Benchmark, Soft-1, Soft-2, and Hard label models are 39.675, 40.633, 40.334, and 37.574, respectively, with standard deviations of 29.963, 28.552, 28.167, and 27.081, respectively. And the average mAP@10 for Benchmark, Soft-1, Soft-2, and Hard label models are 34.419, 35.323, 35.04, and 32.243, respectively, with standard deviations of 29.693, 28.587, 28.221, and 26.585, respectively. The win rate of Soft-1 over the Benchmark is 50.37% in terms of nDCG@10, and is 55.38% with respect to mAP@10. This again confirms that no single text embedding method dominates across all tasks (Muennighoff et al., 2022). The Soft-1 and Soft-2 models demonstrate promising results with higher scores and smaller standard deviations compared to the Benchmark model, suggesting they perform well across various datasets and their performance is consistently stable. The Hard-label model, on the other hand, has worse nDCG@10 and mAP@10 scores compared to the Benchmark; although it has a smaller standard deviation. The improvement seen in the fine-tuning with Soft-1 and Soft-2 labels might be attributed to the reduced anisotropy in the fine-tuned models (meaning the text embeddings occupy a larger cone in the vector space after fine-tuning). This property is further supported by the results on the held-out set: the Soft-1 and Soft-2 models have better results in terms of area under precision-recall (PR) curve (see Section 4.3). The text embeddings of irrelevant pairs are then distributed across a wider range of the vector space. Pdf: https://arxiv.org/pdf/2408.11868

Large Language Model Compression

In this work, we tackle the critical challenge of compressing large language models (LLMs) to facilitate their practical deployment and broader adoption. We introduce a novel post-training compression paradigm that focuses on the low-rank decomposition of LLM weights. Our analysis identifies two main challenges in this task: the variability in LLM activation distributions and handling unseen activations from different datasets and models. To address these challenges, we propose a nested activation-aware framework (NSVD) for LLMs, a training-free approach designed to enhance the accuracy of low-rank decompositions by managing activation outliers by transforming the weight matrix based on activation distribution and the original weight matrix. This method allows for the absorption of outliers into the transformed weight matrix, improving decomposition accuracy. Our comprehensive evaluation across eight datasets and six models from three distinct LLM families demonstrates the superiority of NSVD over current state-of-the-art methods, especially at medium to large compression ratios or in multilingual and multitask settings. First, we evaluate the performance of LLaMA-7B compressed using NSVD (here, $k_1=0.95k$) and baselines under compression ratios ranging from 10% to 50% across all eight datasets. Our results include comparisons with ASVD-II, NSVD-I, and NSVD-II; since no improvements were observed using the proposed ASVD-III method, its results are omitted for brevity. Table 1 summarizes these findings. We observe that ASVD-I and ASVD-II yield equivalent performance when ignoring numerical errors. Similarly, NSVD-I and NSVD-II also produce comparable outcomes. NSVD-I or NSVD-II consistently outperforms standard SVD, ASVD-0, and ASVD-I across all the compression ratios. More importantly, NSVD exhibits significant advantages over baselines under medium to high compression ratios. Specifically, at a 30% compression ratio, compared to the best-performing baseline, NSVD-I reduces perplexity on PTB, C4, SNIPS, AlpacaEval, MCTest, CMRC (CN), and AlpacaEval (JP) by 7.1%, 5.4%, 12.1%, 6.3%, 1.3%, 16.1%, and 54.8%, respectively; when the compression ratio reaches 40%, NSVD can reduce perplexity by more than 60%. Pdf: https://arxiv.org/pdf/2503.17101

会社イベント

詳細を見る

2026-03-12

在字节方舟，每一个“周年”都值得被庆祝

今天不谈工作，只谈——你又陪字节方舟走过了这一年。在字节方舟，我们庆祝公司的成长，更庆祝你的成长。因为我们知道： **方舟之所以能乘风破浪，是因为有每一位船员在坚守岗位。** 今天，是一群小伙伴的“入职周年日”。一周年、两周年、三周年、四周年、五周年…… 每一个数字背后，都是一段关于陪伴、成长和坚持的故事。 **仪式感，从一份礼物开始** 好几天前，行政小姐姐就悄悄买好了周年礼物。在字节方舟，你的每一个重要时刻，我们都帮你记着。 ![IMG_5590.JPG](https://strapi.prod.greaterheat.com/uploads/IMG_5590_581c8856a9.JPG) **问答环节，最真挚的感受** 趁着大家领完礼物、状态放松的时候，我们问了几个简单的问题。每个人的答案都不一样，但听下来，最常出现的词是：感谢。有人说是“可以安心干活的地方”，有人说是“累过但值得的地方”，还有人想了想说：“就是那种，明天还想来的地方。” <div style="display: flex; flex-direction:row; gap: 10px; justify-content: center;"> <div style="flex:1"> <img src="https://strapi.prod.greaterheat.com/uploads/IMG_0485_2_95f5980a2d.JPG" alt="描述1" style="width:100%;"> </div> <div style="flex:1"> <img src="https://strapi.prod.greaterheat.com/uploads/IMG_0474_2_9bb0c5673f.JPG" alt="描述2" style="width:100%;"> </div> </div> **下午茶最简单的快乐** 既然是过节，怎么能少了吃的？今天的下午茶，格外的丰盛，休息区变成了“自助餐厅”。披萨、炸鸡、蛋糕…… 这些看似普通的食物，因为是和这群人一起吃，变得格外香。 <div style="display: flex; flex-direction:row; gap: 10px; justify-content: center;"> <div style="flex:1"> <img src="https://strapi.prod.greaterheat.com/uploads/IMG_0452_2_715e523463.jpg" alt="描述1" style="width:100%;"> </div> <div style="flex:1"> <img src="https://strapi.prod.greaterheat.com/uploads/IMG_0451_2_57c9f94e5f.jpg" alt="描述2" style="width:100%;"> </div> </div> 在字节方舟，我们相信： **好的公司不仅给员工发工资，更给员工“存在感”。** 一周年，你是初来乍到的新伙伴；两周年，你已经能独当一面；三周年、五周年……你成了方舟不可或缺的一部分。无论第几年，谢谢你，选择字节方舟。谢谢你，把这段时光交给我们。 **下一个周年，我们还要一起吃蛋糕、收礼物、笑着拍照。** 入职快乐，亲爱的字节方舟人！

AI主導の次世代ビジネスソリューション

AIレコメンデーションエンジン

インテリジェントショッピングアシスタント

AIコンテンツ生成

AI推論アクセラレーション

グローバルデータセンターネットワーク

ByteArkについて

コアバリュー

事実と知性に基づく意思決定

継続的学習とグローバル視点

傾聴と明快なコミュニケーション

成果主義と卓越への執念

誠実さと変革への勇気

知的財産

創業者

David

Andy

採用情報

推理引擎优化工程师

系统性能优化工程师