研究業績

1. 論文誌（英文）

Yukimasa Sugizaki and Daisuke Takahashi: Improved Modular Multiplication Algorithms Using Solely IEEE 754 Binary Floating-Point Operations, IEEE Transactions on Emerging Topics in Computing, Vol. 13, No. 3, pp. 1259-1271 (2025).
Takuya Edamatsu and Daisuke Takahashi: Fast Multiple-Precision Integer Division Using Intel AVX-512, IEEE Transactions on Emerging Topics in Computing, Vol. 11, No. 1, pp. 224-236 (2023).
Daisuke Takahashi: On the use of Montgomery multiplication in the computation of binary BBP-type formulas for mathematical constants, The Ramanujan Journal, Vol. 59, No. 1, pp. 211-219 (2022).
Yukimasa Sugizaki and Daisuke Takahashi: A Fast Algorithm for Computing the Number of Magic Series, Annals of Combinatorics, Vol. 26, No. 2, pp. 511-532 (2022).
Kazuhiko Komatsu, Ayumu Gomi, Ryusuke Egawa, Daisuke Takahashi, Reiji Suda, and Hiroyuki Takizawa: Xevolver: A code transformation framework for separation of system-awareness from application codes, Concurrency and Computation: Practice and Experience, Vol. 32, No. 7, e5577 (2020).
Daisuke Takahashi: On the computation and verification of π using BBP-type formulas, The Ramanujan Journal, Vol. 51, No. 1, pp. 177-186 (2020).
Takahiro Katagiri and Daisuke Takahashi: Japanese Autotuning Research: Autotuning Languages and FFT, Proceedings of the IEEE, Vol. 106, No. 11, pp. 2056-2067 (2018). (invited paper)
Daisuke Takahashi: Computation of the 100 quadrillionth hexadecimal digit of π on a cluster of Intel Xeon Phi processors, Parallel Computing, Vol. 75, pp. 1-10 (2018).
Yukihiro Hasegawa, Jun-Ichi Iwata, Miwako Tsuji, Daisuke Takahashi, Atsushi Oshiyama, Kazuo Minami, Taisuke Boku, Hikaru Inoue, Yoshito Kitazawa, Ikuo Miyoshi, and Mitsuo Yokokawa: Performance evaluation of ultra-large-scale first-principles electronic structure calculation code on the K computer, International Journal of High Performance Computing Applications, Vol. 28, No. 3, pp. 335-355 (2014).
Yutaka Maruyama, Norio Yoshida, Hiroto Tadano, Daisuke Takahashi, Mitsuhisa Sato, and Fumio Hirata: Massively parallel implementation of 3D-RISM calculation with volumetric 3D-FFT, Journal of Computational Chemistry, Vol. 35, No. 18, pp. 1347-1355 (2014).
Yohei Miki, Daisuke Takahashi, and Masao Mori: Highly scalable implementation of an N-body code on a GPU cluster, Computer Physics Communications, Vol. 184, No. 9, pp. 2159-2168 (2013).
Daisuke Takahashi: Parallel implementation of multiple-precision arithmetic and 2,576,980,370,000 decimal digits of π calculation, Parallel Computing, Vol. 36, No. 8, pp. 439-448 (2010).
Yoshikuni Sato, Daisuke Takahashi, and Reijer Grimbergen: A Shogi Program Based on Monte-Carlo Tree Search, ICGA Journal, Vol. 33, No. 2, pp. 80-92 (2010).
Jun-Ichi Iwata, Daisuke Takahashi, Atsushi Oshiyama, Taisuke Boku, Kenji Shiraishi, Susumu Okada, and Kazuhiro Yabana: A massively-parallel electronic-structure calculations based on real-space density functional theory, Journal of Computational Physics, Vol. 229, No. 6, pp. 2339-2363 (2010).
Tetsuya Sakurai, Yoshihisa Kodaki, Hiroto Tadano, Daisuke Takahashi, Mitsuhisa Sato, and Umpei Nagashima: A parallel method for large sparse generalized eigenvalue problems using a GridRPC system, Future Generation Computer Systems, Vol. 24, No. 6, pp. 613-619 (2008).
Taisuke Boku, Hajime Susa, Kenji Onuma, Masayuki Umemura, Mitsuhisa Sato, and Daisuke Takahashi: Formation of Dwarf Galaxies in Reionized Universe with Heterogeneous Multicomputer System, International Journal for Multiscale Computational Engineering, Vol. 4, No. 2, pp. 281-289 (2006).
Daisuke Takahashi: An algorithm for multiple-precision floating-point multiplication, Applied Mathematics and Computation, Vol. 166, No. 2, pp. 291-298 (2005).
Daisuke Takahashi: A parallel 1-D FFT algorithm for the Hitachi SR8000, Parallel Computing, Vol. 29, No. 6, pp. 679-690 (2003).
Daisuke Takahashi, Mitsuhisa Sato, and Taisuke Boku: Performance Evaluation of the Hitachi SR8000 Using SPEC OMP2001 Benchmarks, International Journal of Parallel Programming, Vol. 31, No. 3, pp. 185-196 (2003).
Daisuke Takahashi: Efficient implementation of parallel three-dimensional FFT on clusters of PCs, Computer Physics Communications, Vol. 152, No. 2, pp. 144-150 (2003).
Daisuke Takahashi: An Extended Split-Radix FFT Algorithm, IEEE Signal Processing Letters, Vol. 8, No. 5, pp. 145-147 (2001).
Daisuke Takahashi: A fast algorithm for computing large Fibonacci numbers, Information Processing Letters, Vol. 75, No. 6, pp. 243-246 (2000).
Daisuke Takahashi and Yasumasa Kanada: High-Performance Radix-2, 3 and 5 Parallel 1-D Complex FFT Algorithms for Distributed-Memory Parallel Computers, The Journal of Supercomputing, Vol. 15, No. 2, pp. 207-228 (2000).

2. 論文誌（和文）

佐藤佳州，高橋大介：対局に基づいた教師データの重要度の学習，情報処理学会論文誌, Vol. 55, No. 11, pp. 2399-2409 (2014).
椋木大地，高橋大介：GPUにおける3倍・4倍精度浮動小数点演算の実現と性能評価，情報処理学会論文誌コンピューティングシステム, Vol. 6, No. 1, pp. 66-77 (2013).
佐藤佳州，高橋大介：探索結果を利用した実現確率探索，情報処理学会論文誌, Vol. 51, No. 11, pp. 2021-2030 (2010).
佐藤佳州，高橋大介：モンテカルロ木探索によるコンピュータ将棋，情報処理学会論文誌, Vol. 50, No. 11, pp. 2740-2751 (2009).
高橋睦史，佐藤三久，高橋大介，朴泰祐，宇川彰，中村宏，青木秀貴，澤本英雄，助川直伸：演算加速機構を持つオンチップメモリプロセッサの検討と電力性能評価，情報処理学会論文誌コンピューティングシステム, Vol. 2, No. 1, pp. 158-172 (2009).
横澤拓弥，高橋大介，朴泰祐，佐藤三久：行列積を用いた古典Gram-Schmidt直交化法の並列化，情報処理学会論文誌コンピューティングシステム, Vol. 1, No. 1, pp. 61-72 (2008).
岡本高幸，三浦信一，朴泰祐，佐藤三久，高橋大介：EthernetマルチリンクによるPCクラスタ向け高バンド幅・耐故障ネットワークRI2N/UDP，情報処理学会論文誌：コンピューティングシステム, Vol. 48, No. SIG 8(ACS 18), pp. 153-164 (2007).
木村英明，佐藤三久，堀田義彦，朴泰祐，高橋大介：DVS制御による負荷不均衡のある並列プログラムの電力量削減手法，情報処理学会論文誌：コンピューティングシステム, Vol. 47, No. SIG 12(ACS 15), pp. 285-295 (2006).
堀田義彦，佐藤三久，木村英明，松岡聡，朴泰祐，高橋大介：PCクラスタにおける電力実行プロファイル情報を用いたDVS制御による電力性能の最適化，情報処理学会論文誌：コンピューティングシステム, Vol. 47, No. SIG 12(ACS 15), pp. 272-284 (2006).
三浦信一，岡本高幸，朴泰祐，佐藤三久，高橋大介：VFREC-Net：ドライバ制御によるtagged-VLANを用いたPCクラスタ向けマルチパスネットワーク，情報処理学会論文誌：コンピューティングシステム, Vol. 47, No. SIG 12(ACS 15), pp. 35-45 (2006).
中島佳宏，佐藤三久，相田祥昭，高橋大介，朴泰祐，Franck Cappello：複数グリッドジョブ実行システムの計算資源を統合・利用するGrid RPCシステムの設計と実装，情報処理学会論文誌：コンピューティングシステム, Vol. 47, No. SIG 7(ACS 14), pp. 207-218 (2006).
中島浩，中村宏，佐藤三久，朴泰祐，松岡聡，高橋大介，堀田義彦：高性能計算のための低電力・高密度クラスタ MegaProto，情報処理学会論文誌：コンピューティングシステム, Vol. 46, No. SIG 12(ACS 11), pp. 46-61 (2005).
小島好紀，佐藤三久，朴泰祐，高橋大介：MPIを通信レイヤに用いるソフトウェア分散共有メモリシステム，情報処理学会論文誌：コンピューティングシステム, Vol. 46, No. SIG 7(ACS 10), pp. 63-73 (2005).
櫻井鉄也，多田野寛人，早川賢太郎，佐藤三久，高橋大介，長嶋雲兵，稲富雄一，梅田宏明，渡邊寿雄：大規模固有値問題のmaster-worker型並列解法，情報処理学会論文誌：コンピューティングシステム, Vol. 46, No. SIG 7(ACS 10), pp. 44-51 (2005).
堀田義彦，佐藤三久，朴泰祐，高橋大介，中村宏，中島佳宏，高橋睦史：プロセッサの消費電力測定と低消費電力プロセッサによるクラスタの検討，情報処理学会論文誌：コンピューティングシステム, Vol. 45, No. SIG 11(ACS 7), pp. 207-218 (2004).
高橋大介，朴泰祐，佐藤三久：Short Vector SIMD命令を用いた並列FFTの実現と評価，情報処理学会論文誌：コンピューティングシステム, Vol. 45, No. SIG 11(ACS 7), pp. 50-61 (2004).
中島佳宏，佐藤三久，後藤仁志，朴泰祐，高橋大介：CONFLEX-G：OmniRPCによるグリッド環境上での分子立体配座探索，情報処理学会論文誌：コンピューティングシステム, Vol. 45, No. SIG 6(ACS 6), pp. 254-264 (2004).
大滝雄介，高橋大介，朴泰祐，佐藤三久：ヘテロなクラスタ環境におけるStrassenの行列積アルゴリズムの並列化，情報処理学会論文誌：コンピューティングシステム, Vol. 45, No. SIG 6(ACS 6), pp. 122-133 (2004).
佐藤三久，朴泰祐，高橋大介：OmniRPC：グリッド環境での並列プログラミングのためのGrid RPCシステム，情報処理学会論文誌：コンピューティングシステム, Vol. 44, No. SIG 11(ACS 3), pp. 34-45 (2003).
朴泰祐，佐藤三久，小沼賢治，牧野淳一郎，須佐元，高橋大介，梅村雅之：HMCS-G：グリッド環境における計算宇宙物理のためのハイブリッド計算システム，情報処理学会論文誌：コンピューティングシステム, Vol. 44, No. SIG 11(ACS 3), pp. 1-13 (2003).
高橋睦史，近藤正章，朴泰祐，高橋大介，中村宏，佐藤三久：HPC向けオンチップメモリプロセッサアーキテクチャSCIMAのSMP化の検討と性能評価，情報処理学会論文誌：コンピューティングシステム, Vol. 44, No. SIG 6(ACS 1), pp. 76-86 (2003).
吉川茂洋，朴泰祐，佐藤三久，高橋大介，Carol G. Hoover，William G. Hoover：SMP-PCクラスタにおけるSPAM粒子シミュレーションのハイブリッド並列化，情報処理学会論文誌：ハイパフォーマンスコンピューティングシステム, Vol. 43, No. SIG 6(HPS 5), pp. 143-152 (2002).
高橋大介，朴泰祐，佐藤三久，PCクラスタにおける並列一次元FFTのブロックアルゴリズム，情報処理学会論文誌：ハイパフォーマンスコンピューティングシステム, Vol. 43, No. SIG 6(HPS 5), pp. 134-142 (2002).
高橋大介：共有メモリ型並列計算機における並列FFTのブロックアルゴリズム，情報処理学会論文誌, Vol. 43, No. 4, pp. 995-1004 (2002).
高橋大介，金田康正：積和演算に向いた8基底FFTカーネルの提案，情報処理学会論文誌, Vol. 41, No. 7, pp. 2018-2026 (2000).
高橋大介：Fibonacci数の高速計算法，情報処理学会論文誌, Vol. 41, No. 6, pp. 1918-1921 (2000).
後保範，金田康正，高橋大介：級数に基づく多数桁計算の演算量削減を実現する分割有理数化法，情報処理学会論文誌, Vol. 41, No. 6, pp. 1811-1819 (2000).
高橋大介，金田康正：分散メモリ型並列計算機による円周率の515億桁計算，情報処理学会論文誌, Vol. 39, No. 7, pp. 2074-2083 (1998).
高橋大介，金田康正：分散メモリ型並列計算機による2，3，5基底一次元FFTの実現と評価，情報処理学会論文誌, Vol. 39, No. 3, pp. 519-528 (1998).
高橋大介，金田康正：多数桁の円周率を計算するための公式の改良：ガウス–ルジャンドルの公式とボールウェインの4次の収束の公式，情報処理学会論文誌, Vol. 38, No. 11, pp. 2406-2409 (1997).
高橋大介，鳥居泰伸，湯淺太一：SIMD型超並列計算機における素因数分解，情報処理学会論文誌, Vol. 36, No. 11, pp. 2521-2530 (1995).

3. 国際会議論文（査読あり）

Yukimasa Sugizaki and Daisuke Takahashi: Improved Implementation of Number Theoretic Transform on NVIDIA GPU with Tensor Cores, Proc. Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region (SCA/HPCAsia 2026), pp. 142-152 (2026).
Yukimasa Sugizaki and Daisuke Takahashi: An Improved Implementation of Multi-Threaded Number Theoretic Transform Using Arm Scalable Vector Extension Instruction Set, Proc. 24th International Symposium on Parallel and Distributed Computing (ISPDC 2025), pp. 68-75 (2025).
Tomoya Nagahashi and Daisuke Takahashi: Construction of Large Zero-Aware Pattern Databases for Sliding Puzzles on Distributed Memory Machines, Proc. 25th International Conference on Computational Science and Its Applications (ICCSA 2025), Part I, Lecture Notes in Computer Science, Vol. 15648, pp. 272-284, Springer (2025).
Daisuke Takahashi: Implementation of Multiple Multiplicative Inverses Modulo 2^w Using Intel AVX-512 Instructions, Proc. 25th International Conference on Computational Science and Its Applications (ICCSA 2025), Part III, Lecture Notes in Computer Science, Vol. 15650, pp. 375-384, Springer (2025). (short paper)
Daisuke Takahashi: Parallel Implementation of Number-Theoretic Transform on GPU Clusters, Proc. 24th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2024), Part III, Lecture Notes in Computer Science, Vol. 15253, pp. 204-218, Springer (2025).
Daisuke Takahashi: On the Division in the Computation of Binary BBP-Type Formulas for Mathematical Constants, Proc. 4th International Conference on Numerical Computations: Theory and Algorithms (NUMTA 2023), Part II, Lecture Notes in Computer Science, Vol. 14477, pp. 323-330, Springer (2025). (short paper)
Shota Kawakami and Daisuke Takahashi: Implementation and Evaluation of Octuple-Precision Fast Fourier Transform on GPU, Proc. 2024 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA 2024), pp. 287-294 (2024).
Toshihiro Hanawa, Kengo Nakajima, Yohei Miki, Takashi Shimokawabe, Kazuya Yamazaki, Shinji Sumimoto, Osamu Tatebe, Taisuke Boku, Daisuke Takahashi, Akira Nukada, Norihisa Fujita, Ryohei Kobayashi, Hiroto Tadano, and Akira Naruse: Preliminary Performance Evaluation of Grace-Hopper GH200, Proc. 2024 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops 2024), pp. 184-185 (2024). (poster paper)
Daisuke Takahashi: Multiple Integer Divisions with an Invariant Dividend and Monotonically Increasing or Decreasing Divisors, Proc. 23rd International Conference on Computational Science and Its Applications (ICCSA 2023), Part II, Lecture Notes in Computer Science, Vol. 13957, pp. 393-401, Springer (2023). (short paper)
Takuya Edamatsu and Daisuke Takahashi: Efficient Large Integer Multiplication with Arm SVE Instructions, Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2023), pp. 9-17 (2023).
Daisuke Takahashi: An Implementation of Parallel Number-Theoretic Transform Using Intel AVX-512 Instructions, Proc. 24th International Workshop on Computer Algebra in Scientific Computing (CASC 2022), Lecture Notes in Computer Science, Vol. 13366, pp. 318-332, Springer (2022).
Takeyuki Harayama, Shuhei Kudo, Daichi Mukunoki, Toshiyuki Imamura, and Daisuke Takahashi: A Rapid Euclidean Norm Calculation Algorithm that Reduces Overflow and Underflow, Proc. 21st International Conference on Computational Science and Its Applications (ICCSA 2021), Part I, Lecture Notes in Computer Science, Vol. 12949, pp. 95-110, Springer (2021).
Naruya Kitai, Daisuke Takahashi, Franz Franchetti, Takahiro Katagiri, Satoshi Ohshima, and Toru Nagai: An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL, Proc. 2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2021), The 16th International Workshop on Automatic Performance Tuning (iWAPT 2021), pp. 789-797 (2021).
Daisuke Takahashi: Fast Multiple Montgomery Multiplications Using Intel AVX-512IFMA Instructions, Proc. 20th International Conference on Computational Science and Its Applications (ICCSA 2020), Part V, Lecture Notes in Computer Science, Vol. 12253, pp. 655-663, Springer (2020). (short paper)
Yukimasa Sugizaki and Daisuke Takahashi: Fast Computation of the Exact Number of Magic Series with an Improved Montgomery Multiplication Algorithm, Proc. 20th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2020), Part II, Lecture Notes in Computer Science, Vol. 12453, pp. 365-382, Springer (2020).
Daisuke Takahashi: Implementation of Parallel 3-D Real FFT with 2-D Decomposition on Intel Xeon Phi Clusters, Proc. 13th International Conference on Parallel Processing and Applied Mathematics (PPAM 2019), Part I, Lecture Notes in Computer Science, Vol. 12043, pp. 151-161, Springer (2020).
Takuya Edamatsu and Daisuke Takahashi: Accelerating Large Integer Multiplication Using Intel AVX-512IFMA, Proc. 19th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2019), Part I, Lecture Notes in Computer Science, Vol. 11944, pp. 60-74, Springer (2020).
Daisuke Takahashi and Franz Franchetti: FFTE on SVE: SPIRAL-Generated Kernels, Proc. International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2020), pp. 114-122 (2020).
Samar Aseeri, Benson K. Muite, and Daisuke Takahashi: Reproducibility in Benchmarking Parallel Fast Fourier Transform based Applications, Companion of the 2019 ACM/SPEC International Conference on Performance Engineering (ICPE'19), pp. 5-8 (2019). (vision paper)
Takuya Edamatsu and Daisuke Takahashi: Acceleration of Large Integer Multiplication with Intel AVX-512 Instructions, Proc. 20th IEEE International Conference on High Performance Computing and Communications (HPCC-2018), pp. 211-218 (2018).
Daisuke Takahashi: An Implementation of Parallel 1-D Real FFT on Intel Xeon Phi Processors, Proc. 17th International Conference on Computational Science and Its Applications (ICCSA 2017), Part I, Lecture Notes in Computer Science, Vol. 10404, pp. 401-410, Springer (2017).
Hiroyuki Takizawa, Daichi Sato, Shoichi Hirasawa, and Daisuke Takahashi: A Customizable Auto-Tuning Scenario with User-defined Code Transformations, Proc. 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2017), The 12th International Workshop on Automatic Performance Tuning (iWAPT 2017), pp. 1372-1378 (2017).
Daichi Mukunoki, Toshiyuki Imamura, and Daisuke Takahashi: Automatic Thread-Block Size Adjustment for Memory-Bound BLAS Kernels on GPUs, Proc. 2016 IEEE 10th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC-16), Special Session: Auto-Tuning for Multicore and GPU (ATMG), pp. 377-384 (2016).
Daisuke Takahashi: Automatic Tuning of Computation-Communication Overlap for Parallel 1-D FFT, Proc. 2016 IEEE 19th International Conference on Computational Science and Engineering (CSE 2016), pp. 253-256 (2016). (short paper)
Daisuke Takahashi: Implementation of Multiple-Precision Floating-Point Arithmetic on Intel Xeon Phi Coprocessors, Proc. 16th International Conference on Computational Science and Its Applications (ICCSA 2016), Part II, Lecture Notes in Computer Science, Vol. 9787, pp. 60-70, Springer (2016).
Hiroshi Maeda and Daisuke Takahashi: Parallel Sparse Matrix-Vector Multiplication Using Accelerators, Proc. 16th International Conference on Computational Science and Its Applications (ICCSA 2016), Part II, Lecture Notes in Computer Science, Vol. 9787, pp. 3-18, Springer (2016).
Hiroshi Maeda and Daisuke Takahashi: Performance Evaluation of Sparse Matrix-Vector Multiplication Using GPU/MIC Cluster, Proc. 2015 Third International Symposium on Computing and Networking (CANDAR'15), 3rd International Workshop on Computer Systems and Architectures (CSA'15), pp. 396-399 (2015). (poster paper)
Daisuke Takahashi: An Implementation of Parallel 1-D FFT Using AVX Instructions on Multi-Core Processors, Proc. 2012 International Workshop on Innovative Architecture for Future Generation Processors and Systems (IWIA 2012), pp. 83-88 (2015).
Daisuke Takahashi: Optimization of All-to-All Communication on Multi-Core Cluster Systems, Proc. 2011 International Workshop on Innovative Architecture for Future Generation Processors and Systems (IWIA 2011), pp. 3-7 (2015).
Daichi Mukunoki, Toshiyuki Imamura, and Daisuke Takahashi: Fast Implementation of General Matrix-Vector Multiplication (GEMV) on Kepler GPUs, Proc. 23rd Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP 2015), pp. 642-650 (2015).
Daichi Mukunoki and Daisuke Takahashi: Using Quadruple Precision Arithmetic to Accelerate Krylov Subspace Methods on GPUs, Proc. 10th International Conference on Parallel Processing and Applied Mathematics (PPAM 2013), Part I, Workshop on Numerical Algorithms on Hybrid Architectures, Lecture Notes in Computer Science, Vol. 8384, pp. 632-642, Springer (2014).
Takaaki Hiragushi and Daisuke Takahashi: Efficient Hybrid Breadth-First Search on GPUs, Proc. 13th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2013), Part II, 2013 International Symposium on Advances of Distributed and Parallel Computing (ADPC 2013), Lecture Notes in Computer Science, Vol. 8286, pp. 40-50, Springer (2013).
Daisuke Takahashi: Implementation of Parallel 1-D FFT on GPU Clusters, Proc. 2013 IEEE 16th International Conference on Computational Science and Engineering (CSE 2013), pp. 174-180 (2013).
Yoshikuni Sato, Makoto Miwa, Shogo Takeuchi, and Daisuke Takahashi: Optimizing Objective Function Parameters for Strength in Computer Game-Playing, Proc. 27th AAAI Conference on Artificial Intelligence (AAAI-13), pp. 869-875 (2013).
Daichi Mukunoki and Daisuke Takahashi: Optimization of Sparse Matrix-vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs, Proc. 13th International Conference on Computational Science and Its Applications (ICCSA 2013), Part V, Lecture Notes in Computer Science, Vol. 7975, pp. 211-223, Springer (2013).
Hiroki Yoshizawa and Daisuke Takahashi: Automatic Tuning of Sparse Matrix-Vector Multiplication for CRS format on GPUs, Proc. 2012 IEEE 15th International Conference on Computational Science and Engineering (CSE 2012), pp. 130-136 (2012).
Daisuke Takahashi: An Implementation of Parallel 2-D FFT Using Intel AVX Instructions on Multi-Core Processors, Proc. 12th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2012), Part II, Lecture Notes in Computer Science, Vol. 7440, pp. 197-205, Springer (2012). (short paper)
Daisuke Takahashi, Atsuya Uno, and Mitsuo Yokokawa: An Implementation of Parallel 1-D FFT on the K computer, Proc. 2012 IEEE 14th International Conference on High Performance Computing and Communications (HPCC-2012), pp. 344-350 (2012).
T. Boku, K.-I. Ishikawa, Y. Kuramashi, K. Minami, Y. Nakamura, F. Shoji, D. Takahashi, M. Terai, A. Ukawa, and T. Yoshie: Multi-block/multi-core SSOR preconditioner for the QCD quark solver for K computer, Proceedings of Science, The 30th International Symposium on Lattice Field Theory (Lattice 2012), p. 188 (2012).
Yohei Miki, Daisuke Takahashi, and Masao Mori: A Fast Implementation and Performance Analysis of Collisionless N-body Code Based on GPGPU, Proc. International Conference on Computational Science (ICCS 2012), Procedia Computer Science, Vol. 9, pp. 96-105, Elsevier (2012).
Takuma Nomizu, Daisuke Takahashi, Jinpil Lee, Taisuke Boku, and Mitsuhisa Sato: Implementation of XcalableMP Device Acceleration Extention with OpenCL, Proc. 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW 2012), Multicore and GPU Programming Models, Languages and Compilers Workshop (PLC 2012), pp. 2394-2403 (2012).
Daichi Mukunoki and Daisuke Takahashi: Implementation and Evaluation of Triple Precision BLAS Subroutines on GPUs, Proc. 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW 2012), The 13th Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-12), pp. 1378-1386 (2012).
Daichi Mukunoki and Daisuke Takahashi: Implementation and Evaluation of Quadruple Precision BLAS Functions on GPUs, Proc. 10th International Conference on Applied Parallel and Scientific Computing (PARA 2010), Part I, Lecture Notes in Computer Science, Vol. 7133, pp. 249-259, Springer (2012).
Takatoshi Nakayama and Daisuke Takahashi: Implementation of Multiple-Precision Floating-Point Arithmetic Library for GPU Computing, Proc. 23rd IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2011), pp. 343-349 (2011).
Yukihiro Hasegawa, Jun-Ichi Iwata, Miwako Tsuji, Daisuke Takahashi, Atsushi Oshiyama, Kazuo Minami, Taisuke Boku, Fumiyoshi Shoji, Atsuya Uno, Motoyoshi Kurokawa, Hikaru Inoue, Ikuo Miyoshi, and Mitsuo Yokokawa: First-principles calculations of electron states of a silicon nanowire with 100,000 atoms on the K computer, Proc. 2011 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11) (2011).
Yuji Kubota and Daisuke Takahashi: Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPU, Proc. 11th International Conference on Computational Science and Its Applications (ICCSA 2011), Part II, Lecture Notes in Computer Science, Vol. 6783, pp. 547-561, Springer (2011).
Daisuke Takahashi: An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core Processors, Proc. 8th International Conference on Parallel Processing and Applied Mathematics (PPAM 2009), Part I, Workshop on Memory Issues on Multi- and Manycore Platforms, Lecture Notes in Computer Science, Vol. 6067, pp. 606-614, Springer (2010).
Chikafumi Takahashi, Mitsuhisa Sato, Daisuke Takahashi, Taisuke Boku, Akira Ukawa, Hiroshi Nakamura, Hidetaka Aoki, Hideo Sawamoto, and Naonobu Sukegawa: Design and Power Performance Evaluation of On-Chip Memory Processor with Arithmetic Accelerators, Proc. 2008 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA 2008), pp. 51-57 (2009).
Daisuke Takahashi: A Parallel Algorithm for Multiple-Precision Division by a Single-Precision Integer, Proc. 6th International Conference on Large-Scale Scientific Computations (LSSC 2007), Lecture Notes in Computer Science, Vol. 4818, pp. 729-736, Springer (2008).
Chikafumi Takahashi, Mitsuhisa Sato, Daisuke Takahashi, Taisuke Boku, Hiroshi Nakamura, Masaaki Kondo, and Motonobu Fujita: Empirical Study for Optimization of Power-Performance with On-Chip Memory, Proc. First International Workshop on Advanced Low Power Systems (ALPS 2006), Lecture Notes in Computer Science, Vol. 4759, pp. 466-479, Springer (2008).
Daisuke Takahashi: Implementation and Evaluation of Parallel FFT Using SIMD Instructions on Multi-Core Processors, Proc. 2007 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA 2007), pp. 53-59 (2008).
Daisuke Takahashi: An Implementation of Parallel 1-D FFT Using SSE3 Instructions on Dual-Core Processors, Proc. 8th International Workshop on State of the Art in Scientific Computing (PARA 2006), Lecture Notes in Computer Science, Vol. 4699, pp. 1178-1187, Springer (2007).
Akira Nukada, Daisuke Takahashi, Reiji Suda, and Akira Nishida: High Performance FFT on SGI Altix 3700, Proc. 3rd International Conference on High Performance Computing and Communications (HPCC 2007), Lecture Notes in Computer Science, Vol. 4782, pp. 396-407, Springer (2007).
Takayuki Imada, Mitsuhisa Sato, Yoshihiko Hotta, Hideaki Kimura, Taisuke Boku, Daisuke Takahashi, Shinichi Miura, and Hiroshi Nakashima: Power-performance Evaluation on Ultra-Low Power High-performance Cluster System: MegaProto/E, Proc. IEEE Symposium on Low-Power and High-Speed Chips (COOL Chips X), pp. 117-129 (2007).
Takayuki Okamoto, Shinichi Miura, Taisuke Boku, Mitsuhisa Sato, and Daisuke Takahashi: RI2N/UDP: High bandwidth and fault-tolerant network for PC-cluster based on multi-link Ethernet, Proc. 21th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), The Workshop on Communication Architecture for Clusters (CAC 2007) (2007).
Hideaki Kimura, Mitsuhisa Sato, Yoshihiko Hotta, Taisuke Boku, and Daisuke Takahashi: Empirical Study on Reducing Energy of Parallel Programs using Slack Reclamation by DVFS, Proc. 2006 IEEE International Conference on Cluster Computing (Cluster 2006), pp. 1-10 (2006).
Taisuke Boku, Mitsuhisa Sato, Akira Ukawa, Daisuke Takahashi, Shinji Sumimoto, Kouichi Kumon, Takashi Moriyama, and Masaaki Shimizu: PACS-CS: A large-scale bandwidth-aware PC cluster for scientific computations, Proc. Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06), pp. 233-240 (2006).
Daisuke Takahashi: A Hybrid MPI/OpenMP Implementation of a Parallel 3-D FFT on SMP Clusters, Proc. 6th International Conference on Parallel Processing and Applied Mathematics (PPAM 2005), Lecture Notes in Computer Science, Vol. 3911, pp. 970-977, Springer (2006).
Yoshiaki Aida, Yoshihiro Nakajima, Mitsuhisa Sato, Tetsuya Sakurai, Daisuke Takahashi, and Taisuke Boku: Performance Improvement by Data Management Layer in a Grid RPC System, Proc. First International Conference on Grid and Pervasive Computing (GPC 2006), Lecture Notes in Computer Science, Vol. 3947, pp. 324-335, Springer (2006).
Taisuke Boku, Mitsuhisa Sato, Daisuke Takahashi, Hiroshi Nakashima, Hiroshi Nakamura, Satoshi Matsuoka, and Yoshihiko Hotta: MegaProto/E: Power-Aware High-Performance Cluster with Commodity Technology, Proc. 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2006), The Second Workshop on High-Performance, Power-Aware Computing (HP-PAC 2006) (2006).
Yoshihiko Hotta, Mitsuhisa Sato, Hideaki Kimura, Satoshi Matsuoka, Taisuke Boku, and Daisuke Takahashi: Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC cluster, Proc. 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2006), The Second Workshop on High-Performance, Power-Aware Computing (HP-PAC 2006) (2006).
Daisuke Takahashi, Taisuke Boku, and Mitsuhisa Sato: An Implementation of Parallel 3-D FFT Using Short Vector SIMD Instructions on Clusters of PCs, Proc. 7th International Workshop on Applied Parallel Computing (PARA 2004), Lecture Notes in Computer Science, Vol. 3732, pp. 1159-1167, Springer (2006).
Tetsuya Sakurai, Kentaro Hayakawa, Mitsuhisa Sato, and Daisuke Takahashi: A Parallel Method for Large Sparse Generalized Eigenvalue Problems by OmniRPC in a Grid Environment, Proc. 7th International Workshop on Applied Parallel Computing (PARA 2004), Lecture Notes in Computer Science, Vol. 3732, pp. 1151-1158, Springer (2006).
Daisuke Takahashi, Mitsuhisa Sato, and Taisuke Boku: Computation of High-Precision Mathematical Constants in a Combined Cluster and Grid Environment, Proc. 5th International Conference on Large-Scale Scientific Computations (LSSC 2005), Lecture Notes in Computer Science, Vol. 3743, pp. 454-461, Springer (2006).
Yoshinori Ojima, Mitsuhisa Sato, Taisuke Boku, and Daisuke Takahashi: Design of a Software Distributed Shared Memory System using an MPI communication layer, Proc. 8th International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN 2005), pp. 220-229 (2005).
Shinichi Miura, Takayuki Okamoto, Taisuke Boku, Mitsuhisa Sato, and Daisuke Takahashi: Low-cost High-bandwidth Tree Network for PC Clusters based on Tagged-VLAN Technology, Proc. 8th International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN 2005), pp. 84-93 (2005).
Hiroshi Nakashima, Hiroshi Nakamura, Mitsuhisa Sato, Taisuke Boku, Satoshi Matsuoka, Daisuke Takahashi, and Yoshihiko Hotta: MegaProto: 1TFlops/10kW Rack Is Feasible Even with Only Commodity Technology, Proc. 2005 ACM/IEEE Conference on Supercomputing (SC|05) (2005).
Hiroshi Nakashima, Hiroshi Nakamura, Mitsuhisa Sato, Taisuke Boku, Satoshi Matsuoka, Daisuke Takahashi, and Yoshihiko Hotta: MegaProto: A Low-Power and Compact Cluster for High-Performance Computing, Proc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05), Workshop on High Performance, Power-Aware Computing (HPPAC) (2005).
Taisuke Boku, Kenji Onuma, Mitsuhisa Sato, Yoshihiro Nakajima, and Daisuke Takahashi: Grid environment for computational astrophysics driven by GRAPE-6 with HMCS-G and OmniRPC, Proc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05), Joint Workshop on High-Performance Grid Computing & High-Level Parallel Programming Models (HIPS-HPGC) (2005).
Yoshinori Ojima, Mitsuhisa Sato, Taisuke Boku, and Daisuke Takahashi: Design of Software Distributed Shared Memory System using MPI communication layer, Proc. 4th International Workshop on OpenMP: Experiences and Implementations (WOMPEI 2005), pp. 18-25 (2005).
Taisuke Boku, Mitsuhisa Sato, Masazumi Matsubara, and Daisuke Takahashi: OpenMPI — OpenMP like tool for easy programming in MPI, Proc. 6th European Workshop on OpenMP (EWOMP 2004), pp. 83-88 (2004).
Yoshihiro Nakajima, Mitsuhisa Sato, Hitoshi Goto, Taisuke Boku, and Daisuke Takahashi: Implementation and Performance Evaluation of CONFLEX-G: Grid-enabled Molecular Conformational Space Search Program with OmniRPC, Proc. 18th International Conference on Supercomputing (ICS'04), pp. 154-163 (2004).
Chikafumi Takahashi, Masaaki Kondo, Taisuke Boku, Daisuke Takahashi, Hiroshi Nakamura, and Mitsuhisa Sato: SCIMA-SMP: on-chip memory processor architecture for SMP, Proc. 3rd Workshop on Memory Performance Issues (WMPI'04), pp. 121-128 (2004).
Taisuke Boku, Hajime Susa, Kenji Onuma, Masayuki Umemura, Mitsuhisa Sato, and Daisuke Takahashi: Formation of Dwarf Galaxies in Reionized Universe with Heterogeneous Multi-Computer System, Proc. International Conference on Computational Science 2004 (ICCS 2004), Part IV, Workshop on Modeling and Simulation of Multi-physics Multi-scale Systems, Lecture Notes in Computer Science, Vol. 3039, pp. 629-636, Springer (2004).
Yuhsuke Ohtaki, Daisuke Takahashi, Taisuke Boku, and Mitsuhisa Sato: Parallel Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous Clusters, Proc. 18th International Parallel and Distributed Processing Symposium (IPDPS'04), The 13th Heterogeneous Computing Workshop (HCW 2004) (2004).
Yoshihiko Hotta, Mitsuhisa Sato, Taisuke Boku, Daisuke Takahashi, and Chikafumi Takahashi: Measurement and Characterization of Power Consumption of Microprocessors for Power-aware Cluster, Proc. An International Symposium on Low-Power and High-Speed Chips (COOL Chips VII), pp. 293-303 (2004).
Yoshihiro Nakajima, Mitsuhisa Sato, Taisuke Boku, Daisuke Takahashi, and Hitoshi Gotoh: Performance Evaluation of OmniRPC in a Grid Environment, Proc. 2004 International Symposium on Applications and the Internet Workshops (SAINT 2004 Workshops), pp. 658-664 (2004).
Kenji Onuma, Taisuke Boku, Mitsuhisa Sato, Daisuke Takahashi, Hajime Susa, and Masayuki Umemura: Heterogeneous Remote Computing System for Computational Astrophysics with OmniRPC, Proc. 2004 International Symposium on Applications and the Internet Workshops (SAINT 2004 Workshops), pp. 623-629 (2004).
Shinichi Miura, Taisuke Boku, Mitsuhisa Sato, and Daisuke Takahashi: RI2N — Interconnection Network System for Clusters with Wide-Bandwidth and Fault-Tolerancy Based on Multiple Links, Proc. 5th International Symposium on High Performance Computing (ISHPC 2003), Lecture Notes in Computer Science, Vol. 2858, pp. 342-351, Springer (2003).
Daisuke Takahashi: A Radix-16 FFT Algorithm Suitable for Multiply-Add Instruction Based on Goedecker Method, Proc. 2003 IEEE International Conference on Multimedia and Expo (ICME 2003), Vol. 2, pp. 845-848 (2003). (poster paper)
Daisuke Takahashi, Mitsuhisa Sato, and Taisuke Boku: An OpenMP Implementation of Parallel FFT and Its Performance on IA-64 Processors, Proc. International Workshop on OpenMP Applications and Tools (WOMPAT 2003), Lecture Notes in Computer Science, Vol. 2716, pp. 99-108, Springer (2003).
Taisuke Boku, Mitsuhisa Sato, Kenji Onuma, Junichiro Makino, Hajime Susa, Daisuke Takahashi, Masayuki Umemura, and Akira Ukawa: HMCS-G: Grid-enabled Hybrid Computing System for Computational Astrophysics, Proc. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'03), Workshop on Grids and Advanced Networks (GAN'03), pp. 558-565 (2003).
Mitsuhisa Sato, Taisuke Boku, and Daisuke Takahashi: OmniRPC: a Grid RPC System for Parallel Programming in Cluster and Grid Environment, Proc. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'03), pp. 206-213 (2003).
Daisuke Takahashi: A Radix-16 FFT Algorithm Suitable for Multiply-Add Instruction Based on Goedecker Method, Proc. 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Vol. 2, pp. 665-668 (2003). (poster paper)
Shinsuke Nara, Yuichi Goto, Daisuke Takahashi, and Jingde Cheng: Parallel Forward Deduction System for General-Purpose Entailment Calculus on Clusters of PCs, Proc. IASTED International Conference on Networks, Parallel and Distributed Processing, and Applications (NPDPA 2002), pp. 359-364 (2002).
Yuichi Goto, Daisuke Takahashi, and Jingde Cheng: Improving Performance of Automated Forward Deduction System EnCal on Shared-Memory Parallel Computers, Proc. Third International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2002), pp. 63-68 (2002).
Daisuke Takahashi, Taisuke Boku, and Mitsuhisa Sato: A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCs, Proc. 8th International Euro-Par Conference (Euro-Par 2002), Lecture Notes in Computer Science, Vol. 2400, pp. 691-700, Springer (2002).
Daisuke Takahashi: A Blocking Algorithm for Parallel 1-D FFT on Shared-Memory Parallel Computers, Proc. 6th International Conference on Applied Parallel Computing (PARA 2002), Lecture Notes in Computer Science, Vol. 2367, pp. 380-389, Springer (2002).
Daisuke Takahashi, Mitsuhisa Sato, and Taisuke Boku: Performance Evaluation of the Hitachi SR8000 Using OpenMP Benchmarks, Proc. 4th International Symposium on High Performance Computing (ISHPC 2002), Lecture Notes in Computer Science, Vol. 2327, pp. 390-400, Springer (2002).
Yuichi Goto, Daisuke Takahashi, and Jingde Cheng: Parallel Forward Deduction Algorithms of General-Purpose Entailment Calculus on Shared-Memory Parallel Computers, Proc. 2nd International Conference on Software Engineering, Artificial Intelligence, Networking & Parallel/Distributed Computing (SNPD'01), pp. 168-175 (2001).
Daisuke Takahashi: A Blocking Algorithm for FFT on Cache-Based Processors, Proc. 9th International Conference on High Performance Computing and Networking Europe (HPCN Europe 2001), Lecture Notes in Computer Science, Vol. 2110, pp. 551-554, Springer (2001). (poster paper)
Daisuke Takahashi: A Mixed-Radix Parallel Three-Dimensional FFT Algorithm on Clusters of Vector SMPs, Proc. Tenth SIAM Conference on Parallel Processing for Scientific Computing (PP01) (2001).
Seiji Nishimura, Daisuke Takahashi, Takaomi Shigehara, Hiroshi Mizoguchi, and Taketoshi Mishima: A Performance Study on a Single Processing Node of the HITACHI SR8000, Proc. Second International Conference on Numerical Analysis and Its Applications (NAA 2000), Lecture Notes in Computer Science, Vol. 1988, pp. 628-635, Springer (2001).
Daisuke Takahashi: A Parallel 3-D FFT Algorithm on Clusters of Vector SMPs, Proc. 5th International Workshop on Applied Parallel Computing (PARA 2000), Lecture Notes in Computer Science, Vol. 1947, pp. 316-323, Springer (2001).
Daisuke Takahashi: Implementation of Multiple-Precision Parallel Division and Square Root on Distributed-Memory Parallel Computers, Proc. 2000 International Workshop on Parallel Processing (ICPP'00 Workshops), Workshop on High Performance Scientific and Engineering Computing with Applications (HPSECA-00), pp. 229-235 (2000).
Seiji Nishimura, Daisuke Takahashi, Takaomi Shigehara, Hiroshi Mizoguchi, and Taketoshi Mishima: Efficient Implementation of CG & CR Methods for Linear Systems on a Single Processing Node of HITACHI SR8000, Proc. 2000 International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC2000), pp. 298-301 (2000).
Daisuke Takahashi: A New Radix-6 FFT Algorithm Suitable for Multiply-Add Instruction, Proc. 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Vol. 6, pp. 3343-3346 (2000). (poster paper)
Daisuke Takahashi: High-Performance Parallel FFT Algorithms for the HITACHI SR8000, Proc. Fourth International Conference/Exhibition on High Performance Computing in Asia-Pacific Region (HPC-Asia 2000), Vol. 1, pp. 192-199 (2000).
Daisuke Takahashi and Yasumasa Kanada: Fast High-Precision Arithmetic on Distributed Memory Parallel Machines, Proc. Ninth SIAM Conference on Parallel Processing for Scientific Computing (PP99) (1999).

4. 国内学会論文（査読あり）

佐藤佳州，高橋大介：大規模な対局に基づいた教師データの重要度の学習，第17回ゲームプログラミングワークショップ, pp. 22-29 (2012).
佐藤佳州，高橋大介：特徴の生成を組み合わせた機械学習，第16回ゲームプログラミングワークショップ, pp. 135-142 (2011).
椋木大地，高橋大介：GPUによる4倍・8倍精度BLASの実装と評価，2011年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2011論文集, pp. 148-156 (2011).
佐藤佳州，高橋大介：探索結果を利用した実現確率探索，第14回ゲームプログラミングワークショップ, pp. 148-155 (2009).
佐藤佳州，高橋大介：モンテカルロ木探索によるコンピュータ将棋，第13回ゲームプログラミングワークショップ, pp. 1-8 (2008).
高橋睦史，佐藤三久，高橋大介，朴泰祐，宇川彰，中村宏，青木秀貴，澤本英雄，助川直伸：演算加速機構を持つオンチップメモリプロセッサの検討と電力性能評価，2008年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2008論文集, pp. 33-40 (2008).
横澤拓弥，高橋大介，朴泰祐，佐藤三久：行列積を用いた古典Gram-Schmidt直交化法の並列化，2008年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2008論文集, pp. 1-8 (2008).
岡本高幸，三浦信一，朴泰祐，佐藤三久，高橋大介：EthernetマルチリンクによるPCクラスタ向け高バンド幅・耐故障ネットワークRI2N/UDP，2007年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2007論文集, pp. 41-48 (2007).
木村英明，佐藤三久，堀田義彦，朴泰祐，高橋大介：DVS制御による負荷不均衡のある並列プログラムの電力量削減手法，先進的計算基盤システムシンポジウムSACSIS2006論文集, pp. 477-486 (2006).
三浦信一，岡本高幸，朴泰祐，佐藤三久，高橋大介：VFREC-Net: ドライバ制御によるtagged-VLANを用いたPCクラスタ向けマルチパスネットワーク，先進的計算基盤システムシンポジウムSACSIS2006論文集, pp. 117-125 (2006).
相田祥昭，中島佳宏，佐藤三久，櫻井鉄也，高橋大介，朴泰祐：Grid RPCにおける広域データ管理レイヤの利用，先進的計算基盤システムシンポジウムSACSIS2006論文集, pp. 85-92 (2006).
中島佳宏，佐藤三久，相田祥昭，高橋大介，朴泰祐，Franck Cappello：複数グリッドジョブ実行システムの計算資源を統合・利用するGrid RPCシステムの設計と実装，2006年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2006論文集, pp. 17-24 (2006).
小島好紀，佐藤三久，朴泰祐，高橋大介：MPIを通信レイヤに用いるソフトウェア分散共有メモリシステム，2005年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2005論文集, pp. 89-96 (2005).
櫻井鉄也，多田野寛人，早川賢太郎，佐藤三久，高橋大介，長嶋雲兵，稲富雄一，梅田宏明，渡邊寿雄：大規模固有値問題のmaster-worker型並列解法，2005年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2005論文集, pp. 43-50 (2005).
高橋大介，朴泰祐，佐藤三久：Short Vector SIMD命令を用いた並列FFTの実現と評価，先進的計算基盤システムシンポジウムSACSIS2004論文集, pp. 277-286 (2004).
堀田義彦，佐藤三久，朴泰祐，高橋大介，中村宏，中島佳宏，高橋睦史：プロセッサの消費電力測定と低消費電力プロセッサによるクラスタの検討，先進的計算基盤システムシンポジウムSACSIS2004論文集, pp. 19-26 (2004).
大滝雄介，高橋大介，朴泰祐，佐藤三久：ヘテロなクラスタ環境におけるStrassenの行列積アルゴリズムの並列化，2004年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2004論文集, pp. 141-148 (2004).
中島佳宏，佐藤三久，後藤仁志，朴泰祐，高橋大介：CONFLEX-G: OmniRPCによるグリッド環境上での分子立体配座探索，2004年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2004論文集, pp. 95-102 (2004).
朴泰祐，佐藤三久，小沼賢治，牧野淳一郎，須佐元，高橋大介，梅村雅之：HMCS-G: グリッド環境における計算宇宙物理のためのハイブリッド計算システム，先進的計算基盤システムシンポジウムSACSIS2003論文集, pp. 235-242 (2003).
佐藤三久，朴泰祐，高橋大介：OmniRPC: グリッド環境での並列プログラミングのためのGrid RPCシステム，先進的計算基盤システムシンポジウムSACSIS2003論文集, pp. 105-112 (2003).
高橋睦史，近藤正章，朴泰祐，高橋大介，中村宏，佐藤三久：HPC向けオンチップメモリプロセッサアーキテクチャSCIMAのSMP化の検討と性能評価，2003年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2003論文集, pp. 47-54 (2003).
吉川茂洋，朴泰祐，佐藤三久，高橋大介，Hoover, C. G.，Hoover, W. G.：SMP-PCクラスタにおけるSPAM粒子シミュレーションのハイブリッド並列化，並列処理シンポジウムJSPP2002論文集, pp. 63-70 (2002).
高橋大介，朴泰祐，佐藤三久：PCクラスタにおける並列一次元FFTのブロックアルゴリズム，並列処理シンポジウムJSPP2002論文集, pp. 55-62 (2002).
高橋大介：PCクラスタにおける並列三次元FFTのブロックアルゴリズム，2002年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2002論文集, pp. 59-64 (2002).
高橋大介：共有メモリ型並列計算機における並列一次元FFTのブロックアルゴリズム，並列処理シンポジウムJSPP2001論文集, pp. 359-366 (2001).
高橋大介：分散メモリ型並列計算機HITACHI SR8000における並列FFTアルゴリズム，並列処理シンポジウムJSPP2000論文集, pp. 91-98 (2000).
高橋大介，金田康正：分散メモリ型並列計算機による2, 3, 5基底一次元FFTの実現と評価，並列処理シンポジウムJSPP'97論文集, pp. 369-376 (1997).

5. 国際会議論文（査読なし），国内学会論文（査読なし）

杉﨑行優，高橋大介：尾崎スキームIIを利用した剰余整数行列乗算の高速化，情報処理学会研究報告，Vol. 2026-HPC-204，No. 5 (2026).
高橋大介：Intel AVX-512命令を用いた複数のモジュラ逆数計算の高速化，日本応用数理学会2025年度年会講演予稿集 (2025).
高橋大介：GPUクラスタにおける並列数論変換の自動チューニング，日本応用数理学会2025年度年会講演予稿集 (2025).
川上昌汰，高橋大介：より低精度なFFTを用いた高精度なFFTの計算手法の提案，情報処理学会研究報告，Vol. 2025-HPC-200，No. 25 (2025).
高橋大介：数学定数に対する2進BBP型公式の計算における除算について，日本応用数理学会2024年度年会講演予稿集 (2024).
高橋大介：GPUクラスタにおける並列数論変換の実現と評価，日本応用数理学会2024年度年会講演予稿集 (2024).
長橋朋也，高橋大介：MPI/OpenMP並列化によるスライドパズルのZero-Aware Pattern Databaseの構築，情報処理学会研究報告，Vol. 2024-HPC-195，No. 23 (2024).
塙敏博，建部修見，中島研吾，朴泰祐，三木洋平，下川辺隆史，山崎一哉，住元真司，高橋大介，額田彰，藤田典久，小林諒平，多田野寛人，田浦健次朗，細川颯介，髙橋淳一郎，成瀬彰：GH200の予備性能評価，情報処理学会研究報告，Vol. 2024-HPC-195，No. 4 (2024).
川上昌汰，高橋大介：GPUにおける8倍精度高速フーリエ変換の実装と評価，情報処理学会研究報告，Vol. 2024-HPC-194，No. 2 (2024).
山口博將，高橋大介：ルジャンドル予想の数値的検証，情報処理学会第85回全国大会講演論文集，5J-05 (2023).
高橋大介：数学定数に対する2進BBP型公式の計算におけるMontgomery乗算の使用について，日本応用数理学会2022年度年会講演予稿集 (2022).
高橋大介：Intel AVX-512IFMA命令を用いた並列数論変換の実現と評価，日本応用数理学会2022年度年会講演予稿集 (2022).
高橋大介：二次元分割を用いた並列三次元FFTにおける計算と通信のオーバーラップの自動チューニング，日本応用数理学会2021年度年会講演予稿集 (2021).
Naruya Kitai, Daisuke Takahasi, Franz Franchetti, Takahiro Katagiri, Satoshi Ohshima, and Toru Nagai: Adaptation of A64 Scalable Vector Extension for Spiral, 情報処理学会研究報告, Vol. 2021-HPC-178，No. 9 (2021).
原山赳幸，工藤周平，椋木大地，今村俊幸，高橋大介：オーバー・アンダーフローを抑えた高精度かつ高速な2ノルム計算手法，情報処理学会研究報告，Vol. 2020-HPC-177，No. 8 (2020).
杉﨑行優，高橋大介：NVIDIA Volta GPUにおける浮動小数点演算を用いた剰余乗算の高速化，日本応用数理学会2020年度年会講演予稿集 (2020).
高橋大介：Intel AVX-512IFMA命令を用いた複数のMontgomery乗算の高速化，日本応用数理学会2020年度年会講演予稿集 (2020).
枝松拓弥，高橋大介：AVX-512IFMAを用いた多倍長整数乗算の高速化，日本応用数理学会2019年度年会講演予稿集 (2019).
高橋大介：Xeon Phiクラスタにおける二次元分割を用いた並列三次元実数FFTの実現と評価，日本応用数理学会2019年度年会講演予稿集 (2019).
佐藤駿一，高橋大介：GPUにおけるSELL形式疎行列ベクトル積の性能評価，日本応用数理学会2018年度年会講演予稿集 (2018).
高橋大介：Intel AVX-512命令を用いた複数の整数除算の高速化，日本応用数理学会2018年度年会講演予稿集 (2018).
佐藤駿一，高橋大介：GPUにおけるSELL形式疎行列ベクトル積の実装と性能評価，情報処理学会研究報告，Vol. 2018-HPC-164，No. 3 (2018).
高橋大介：数学定数の特定の桁を計算するBBP型公式の高速計算法，日本応用数理学会2017年度年会講演予稿集, pp. 249-250 (2017).
高橋大介：Xeon Phiプロセッサにおける並列一次元実数FFTの実現と評価，日本応用数理学会2017年度年会講演予稿集, pp. 149-150 (2017).
高橋大介：Knights Landingクラスタにおける並列FFTの自動チューニング，2017年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2017論文集, pp. 1-2 (2017).
高橋大介：Xeon Phiクラスタ上の並列FFTにおける通信隠蔽の自動チューニング，計算工学講演会論文集，Vol. 22，C-01-3 (2017).
高橋大介：SIMD命令を用いた整数除算の高速化，日本応用数理学会2016年度年会講演予稿集 (2016).
高橋大介：並列FFTにおける通信隠蔽の自動チューニング，日本応用数理学会2016年度年会講演予稿集 (2016).
五味歩武，高橋大介：最適化手法を自動化するXevolverフレームワーク用定義ファイルの実装，情報処理学会研究報告，Vol. 2016-HPC-155，No. 7 (2016).
高橋大介：FFTにおけるAT，2016年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2016論文集, pp. 47-48 (2016).
高橋大介：並列FFTにおける通信隠蔽の自動チューニング，計算工学講演会論文集，Vol. 21，F-2-1 (2016).
篠塚敬介，高橋大介：中心-半径型区間演算に基づく矩形演算を用いた精度保証付き高速フーリエ変換，情報処理学会研究報告，Vol. 2016-HPC-154，No. 9 (2016).
高橋大介：Xeon Phiにおける並列FFTの実現と評価，日本応用数理学会2015年度年会講演予稿集 (2015).
高橋大介：Xeon Phiにおける多倍長精度浮動小数点演算の実現と評価，日本応用数理学会2015年度年会講演予稿集 (2015).
椋木大地，今村俊幸，高橋大介：NVIDIA GPUにおけるメモリ律速なBLASカーネルのスレッド数自動選択手法，情報処理学会研究報告，Vol. 2015-HPC-150，No. 13 (2015).
高橋大介：Xeon Phiクラスタにおける並列FFTの自動チューニング，計算工学講演会論文集，Vol. 20，E-2-2 (2015).
椋木大地，今村俊幸，高橋大介：NVIDIA GPUにおけるGEMVカーネルの自動チューニング，計算工学講演会論文集，Vol. 20，E-2-1 (2015).
椋木大地，今村俊幸，高橋大介：Kepler・MaxwellアーキテクチャGPUにおける性能が行列形状に依存しない高速なGEMVの実装，Annual Meeting on Advanced Computing System and Infrastructure (ACSI) 2015論文集 (2015).
高橋大介：GPUクラスタにおける並列FFTの自動チューニング，日本応用数理学会2014年度年会講演予稿集 (2014).
高橋大介：GPUクラスタにおける並列FFTの自動チューニング，計算工学講演会論文集，Vol. 19，F-7-3 (2014).
前田広志，高橋大介：GPU/MICクラスタにおける疎行列ベクトル積の性能評価，情報処理学会研究報告，Vol. 2014-HPC-144，No. 4 (2014).
Satoshi Matsuoka, William Kramer, and Daisuke Takahashi: The HPC Decathlon Assessment Measure: A Proposal to Define a New Composite Benchmark for High Performance Computing, Storage, Networking and Analysis, Proc. Workshop on Modeling & Simulation of Exascale Systems and Applications (MODSIM 2013) (2013). (position paper)
椋木大地，高橋大介：GPUにおける4倍精度浮動小数点演算を用いたクリロフ部分空間法の高速化，情報処理学会研究報告，Vol. 2013-HPC-140，No. 35 (2013).
平櫛貴章，高橋大介：GPUクラスタにおける幅優先探索の高速化，情報処理学会研究報告，Vol. 2013-HPC-139，No. 12 (2013).
椋木大地，高橋大介：GPUにおける高速なCRS形式疎行列ベクトル積の実装，情報処理学会研究報告，Vol. 2013-HPC-138，No. 5 (2013).
Hiroyuki Takizawa, Ryusuke Egawa, Daisuke Takahashi, and Reiji Suda: HPC Refactoring with Hierarchical Abstractions to Help Software Evolution, Sustained Simulation Performance 2012: Proceedings of the joint Workshop on High Performance Computing on Vector Systems, Stuttgart (HLRS), and Workshop on Sustained Simulation Performance, Tohoku University, 2012, pp. 27-33, Springer (2013).
椋木大地，高橋大介：GPUにおける4倍精度演算を用いた疎行列反復解法の実装と評価，情報処理学会研究報告，Vol. 2012-ARC-202，2012-HPC-137，No. 37 (2012).
三木洋平，高橋大介，森正夫：大規模GPUクラスタにおけるN体計算コードの演算性能とスケーラビリティの評価，情報処理学会研究報告，Vol. 2012-HPC-136，No. 1 (2012).
高橋大介：ポストペタスケール計算環境に向けた並列FFTの自動チューニング，日本応用数理学会2012年度年会講演予稿集，pp. 285-286 (2012).
吉澤大樹，高橋大介：GPUにおけるCRS形式疎行列ベクトル積の自動チューニング，情報処理学会研究報告，Vol. 2012-HPC-135，No. 31 (2012).
高橋大介：並列FFTにおける自動チューニング，計算工学講演会論文集，Vol. 17，E-7-2 (2012).
Daichi Mukunoki and Daisuke Takahashi: Performance Comparison of Double, Triple and Quadruple Precision Real and Complex BLAS Subroutines on GPUs, Proc. ATIP/A*CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way? (ATIP/A*CRC Workshop '12), pp. 788-790 (2012).
野水拓馬，高橋大介，李珍泌，朴泰祐，佐藤三久：並列言語XcalableMPのアクセラレータ向け言語拡張のOpenCL実装，情報処理学会研究報告，Vol. 2012-HPC-133，No. 9 (2012).
長谷川幸弘，岩田潤一，辻美和子，高橋大介，南一生：京速コンピュータ「京」におけるペタフロップス・アプリケーションRSDFT，2012年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2012論文集, p. 50 (2012).
中山空星，高橋大介：GPU上における多倍長精度浮動小数点演算の実装，情報処理学会研究報告，Vol. 2011-ARC-197，2011-HPC-132，No. 25 (2011).
椋木大地，高橋大介：GPUによる3倍精度浮動小数点演算の検討，情報処理学会研究報告，Vol. 2011-ARC-197，2011-HPC-132，No. 23 (2011).
朴泰祐，佐藤三久，塙敏博，児玉祐悦，高橋大介，建部修見，多田野寛人，藏増嘉伸，吉川耕司，庄司光男：演算加速装置に基づく超並列クラスタHA-PACSによる大規模計算科学，情報処理学会研究報告，Vol. 2011-HPC-130，No. 21 (2011).
久保田悠司，高橋大介：GPUにおける格納形式自動選択による疎行列ベクトル積の高速化，情報処理学会研究報告，Vol. 2010-ARC-192，2010-HPC-128，No. 19 (2010).
高橋大介：並列FFTにおける自動チューニング，日本応用数理学会2010年度年会講演予稿集，pp. 303-304 (2010).
椋木大地，高橋大介：GPUによる4倍精度BLASの実装と評価，計算工学講演会論文集，Vol. 15，No. 2，pp. 891-894 (2010).
高橋大介：ペタスケール計算環境に向けたFFTライブラリ，計算工学講演会論文集，Vol. 15，No. 1，pp. 95-98 (2010).
椋木大地，高橋大介：GPUによる4倍精度BLASの実装と評価，情報処理学会研究報告，Vol. 2009-ARC-186，2009-HPC-123，No. 13 (2009).
多田野寛人，高橋大介，佐藤三久，吉田紀生，丸山豊，平田文男：超並列クラスタにおける3D-RISMへのVolumetric並列三次元FFTの適用と性能評価，情報処理学会研究報告，Vol. 2009-HPC-122，No. 3 (2009).
高橋大介：ペタスケール計算環境に向けたFFTライブラリ，日本応用数理学会2009年度年会講演予稿集，pp. 3-6 (2009).
高橋大介：コンピュータサイエンスとの連携，日本物理学会2009年秋季大会講演概要集，p. 26aQL-3 (2009).
久保田悠司，佐藤佳州，高橋大介：マルチコアプロセッサとSIMD演算によるモンテカルロ木探索を用いたオセロの実装，情報処理学会研究報告，Vol. 2009-GI-22，No. 7 (2009).
高橋大介：次世代スーパーコンピュータに向けた高速フーリエ変換アルゴリズム，第58回理論応用力学講演会講演論文集，pp. 55-56 (2009).
高橋大介：マルチコア超並列クラスタにおけるVolumetric並列三次元FFTの実現と評価，情報処理学会研究報告，2009-ARC-182，2009-HPC-119，pp. 19-24 (2009).
高橋大介，後藤和茂，朴泰祐，建部修見，佐藤三久，三上和徳：T2K筑波システムにおけるLinpack性能評価，情報処理学会研究報告，2008-HPC-116，pp. 55-60 (2008).
佐藤佳州，高橋大介：モンテカルロ法によるコンピュータ将棋の実現，情報処理学会第70回全国大会講演論文集，2U-4 (2008).
住元真司，大江和一，久門耕一，高橋大介，朴泰祐，佐藤三久，藏増嘉伸，吉江友照，宇川彰：PACS-CSにおける隣接通信性能の高速化，情報処理学会研究報告，2007-HPC-111，pp. 213-218 (2007).
高橋睦史，佐藤三久，高橋大介，朴泰祐，宇川彰，中村宏，青木秀貴，澤本英雄，助川直伸：演算加速機構を持つオンチップメモリプロセッサの電力性能評価，情報処理学会研究報告，2007-ARC-174，pp. 37-42 (2007).
高橋睦史，佐藤三久，高橋大介，朴泰祐，宇川彰，中村宏，青木秀貴，澤本英雄，助川直伸：オンチップメモリプロセッサでの演算加速機構の検討，情報処理学会研究報告，2007-ARC-172 2007-HPC-109，pp. 263-268 (2007).
Piotr Luszczek, David H. Bailey, Jack Dongarra, Jeremy Kepner, Robert F. Lucas, Rolf Rabenseifner, and Daisuke Takahashi: The HPC Challenge (HPCC) benchmark suite, Proc. 2006 ACM/IEEE Conference on Supercomputing (SC'06) (2006).
木村英明，佐藤三久，堀田義彦，今田貴之，朴泰祐，高橋大介：MegaProto/Eにおける電力性能評価および電力性能最適化の検討，情報処理学会研究報告，2006-HPC-108，pp. 73-78 (2006).
今田貴之，佐藤三久，堀田義彦，木村英明，朴泰祐，高橋大介，三浦信一：MegaProto/Eにおける電力性能評価および電力性能最適化の検討，情報処理学会研究報告，2006-HPC-108，pp. 67-72 (2006).
三浦信一，岡本高幸，朴泰祐，佐藤三久，高橋大介：tagged-VLANとマルチリンクに基づくPCクラスタ向け高性能・耐故障ネットワークの実装と評価，情報処理学会研究報告，2006-HPC-108，pp. 25-30 (2006).
横澤拓弥，高橋大介，朴泰祐，佐藤三久：行列積を用いた古典Gram-Schmidt直交化の再帰的実装，日本応用数理学会2006年度年会講演予稿集，pp. 324-325 (2006).
住元真司，大江和一，久門耕一，高橋大介，朴泰祐，佐藤三久，吉江友照，宇川彰：PACS-CSのための高性能通信ライブラリインターフェイスの設計，情報処理学会研究報告，2006-HPC-107，pp. 215-219 (2006).
朴泰祐，佐藤三久，高橋大介，宇川彰，深川正一，藤田不二男，清水正明，住元真司，久門耕一：科学技術計算用超並列クラスタPACS-CSの実装と基本性能評価，情報処理学会研究報告，2006-HPC-107，pp. 209-214 (2006).
堀田義彦，佐藤三久，木村英明，朴泰祐，高橋大介：PCクラスタにおける全体電力プロファイルを用いた電力性能最適化，情報処理学会研究報告，2006-ARC-169，pp. 1-6 (2006).
Takuya Yokozawa, Daisuke Takahashi, Taisuke Boku, and Mitsuhisa Sato: Efficient Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix Multiplication, Proc. 4th International Workshop on Parallel Matrix Algorithms and Applications (PMAA'06), pp. 37-38 (2006).
横澤拓弥，高橋大介，朴泰祐，佐藤三久：行列積を用いた古典Gram-Schmidt直交化の並列化手法の検討，情報処理学会研究報告，2006-HPC-106，pp. 31-36 (2006).
岡本高幸，三浦信一，朴泰祐，佐藤三久，高橋大介：EthernetマルチリンクによるPCクラスタ向け耐故障ネットワークRI2N/UDP，先進的計算基盤システムシンポジウムSACSIS2006論文集, pp. 271-272 (2006).
Hideaki Kimura, Mitsuhisa Sato, Yoshihiko Hotta, Taisuke Boku, Daisuke Takahashi: Reducing Energy of Parallel Programs using Slack Reclamation by DVFS in a Power-scalable High Performance Cluster, Proc. IEEE Symposium on Low-Power and High-Speed Chips (COOL Chips IX), p. 187 (2006).
木村英明，佐藤三久，堀田義彦，朴泰祐，高橋大介：DVS制御による負荷不均衡のある並列プログラムの電力量削減手法，情報処理学会研究報告，2006-ARC-167 2006-HPC-105，pp. 151-156 (2006).
堀田義彦，佐藤三久，木村英明，松岡聡，朴泰祐，高橋大介：PCクラスタにおける電力実行プロファイル情報を用いたDVS制御による電力性能の最適化，情報処理学会研究報告，2006-ARC-167 2006-HPC-105，pp. 139-144 (2006).
岡本高幸，三浦信一，朴泰祐，佐藤三久，高橋大介：EthernetマルチリンクによるPCクラスタ向け耐故障ネットワークRI2N/UDP，情報処理学会研究報告，2006-ARC-167 2006-HPC-105，pp. 85-90 (2006).
額田彰，高橋大介，須田礼仁，西田晃：多様な計算環境で高性能を実現するFFTライブラリFFTSS，2006年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2006論文集, p. 33 (2006).
三浦信一，岡本高幸，朴泰祐，佐藤三久，高橋大介：tagged-VLANに基づくPCクラスタ向け高バンド幅ツリーネットワークの開発，情報処理学会研究報告，2005-HPC-104，pp. 13-18 (2005).
相田祥昭，中島佳宏，佐藤三久，櫻井鉄也，高橋大介，朴泰祐：グリッドRPCシステムOmniRPCにおける初期データの分散管理による効率化，情報処理学会研究報告，2005-HPC-104，pp. 7-12 (2005).
朴泰祐，梅村雅之，佐藤三久，高橋大介，中本泰史，須佐元，森正夫：FIRST - 第一世代天体の起源解明のための専用・汎用計算機融合型クラスタ，情報処理学会研究報告，2005-HPC-103，pp. 145-150 (2005).
中島佳宏，佐藤三久，相田祥昭，朴泰祐，高橋大介，Franck Cappello：複数グリッドミドルウエア上で動作するGrid RPCシステムOmniRPCの設計と実装，情報処理学会研究報告，2005-HPC-103，pp. 73-78 (2005).
堀田義彦，佐藤三久，木村英明，朴泰祐，高橋大介，松岡聡：PCクラスタにおけるDVS制御による電力性能の最適化，情報処理学会研究報告，2005-ARC-164，pp. 49-54 (2005).
高橋睦史，佐藤三久，高橋大介，朴泰祐，中村宏，近藤正章，藤田元信：オンチップRAM利用による電力性能の最適化と評価，情報処理学会研究報告，2005-ARC-164，pp. 43-48 (2005).
Mitsuhisa Sato, Yoshihiro Nakajima, Tetsuya Sakurai, Taisuke Boku, and Daisuke Takahashi: OmniRPC Grid Parallel Programming Environment for a Large Scale Numerical Computation, Proc. 17th IMACS World Congress Scientific Computation, Applied Mathematics and Simulation (2005).
Mitsuhisa Sato, Yoshinori Ojima, Taisuke Boku, and Daisuke Takahashi: Portable Software Distributed Shared Memory SCASH-MPI for Omni OpenMP Compiler, Proc. First International Workshop on OpenMP (IWOMP 2005) (2005).
中島浩，中村宏，佐藤三久，朴泰祐，松岡聡，高橋大介，堀田義彦：高性能計算のための低電力・高密度クラスタ MegaProto，情報処理学会研究報告，2005-ARC-162 2005-HPC-101，pp. 121-126 (2005).
長谷川秀彦，須田礼仁，額田彰，梶山民人，中島研吾，高橋大介，小武守恒，藤井昭宏，西田晃：計算環境に依存しない行列計算ライブラリインタフェースSILC，情報処理学会研究報告，2004-HPC-100，pp. 37-42 (2004).
中島佳宏，佐藤三久，朴泰祐，高橋大介，Samir Djilali，Franck Cappello：P2P分散システムXtremWeb上でのGrid RPCシステムOmniRPCの設計，情報処理学会研究報告，2004-HPC-99，pp. 133-138 (2004).
櫻井鉄也，早川賢太郎，佐藤三久，高橋大介：OmniRPCによるグリッド環境での大規模固有値問題の並列解法，京都大学数理解析研究所講究録, No. 1362, pp. 151-160 (2004).
Yoshihiko Hotta, Mitsuhisa Sato, Taisuke Boku, Hiroshi Nakashima, Hiroshi Nakamura, Satoshi Matsuoka, Daisuke Takahashi, Chikafumi Takahashi, Shinichi Miura, Yoshihiro Nakajima, Masaaki Kondo, and Motonobu Fujita: MegaProto: A Prototype of Ultra Low-Power Mega-Scale System, Proc. An International Symposium on Low-Power and High-Speed Chips (COOL Chips VII), Vol. 1, p. 84 (2004).
小島好紀，佐藤三久，朴泰祐，高橋大介：MPI上のソフトウェア分散共有メモリシステム，情報処理学会研究報告，2004-HPC-98，pp. 43-48 (2004).
櫻井鉄也，早川賢太郎，佐藤三久，高橋大介：OmniRPCによるグリッド環境での大規模固有値問題の並列解法，情報処理学会研究報告，2004-ARC-157 2004-HPC-97，pp. 193-197 (2004).
堀田義彦，佐藤三久，朴泰祐，中島浩，中村宏，松岡聡，高橋大介，高橋睦史，三浦信一，中島佳宏，近藤正章，藤田元信：超低電力メガスケールシステムのプロトタイプ: MegaProto，2004年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2004論文集, pp. 79-80 (2004).
三浦信一，朴泰祐，佐藤三久，高橋大介：高バンド幅/耐故障性を持つクラスタ向け結合ネットワークRI2N，先進的計算基盤システムシンポジウムSACSIS2003論文集, pp. 187-188 (2003).
櫻井鉄也，早川賢太郎，佐藤三久，高橋大介：OmniRPCによるグリッド環境での大規模固有値問題の並列解法，2003年度日本応用数理学会年会 (2003).
小島好紀，佐藤三久，朴泰祐，高橋大介：Omni/SCASHにおけるFirst Touch page allocationの実装，情報処理学会研究報告，2003-ARC-154，pp. 145-150 (2003).
堀田義彦，佐藤三久，朴泰祐，高橋大介，高橋睦史，中村宏：低消費電力プロセッサによるクラスタの検討，情報処理学会研究報告，2003-ARC-154，pp. 91-96 (2003).
中島佳宏，佐藤三久，朴泰祐，高橋大介，後藤仁志：OmniRPCによる広域ネットワーク環境でのグリッドアプリケーションの性能評価，情報処理学会研究報告，2003-HPC-95，pp. 71-76 (2003).
三浦信一，朴泰祐，佐藤三久，高橋大介：高バンド幅/耐故障性を持つクラスタ向けネットワークRI2Nの性能評価，情報処理学会研究報告，2003-HPC-95，pp. 53-58 (2003).
小沼賢治，朴泰祐，佐藤三久，高橋大介：重力計算専用計算機GRAPE-6のリモートアクセス環境，情報処理学会研究報告，2003-HPC-94，pp. 31-36 (2003).
Yoshihiko Hotta, Mitsuhisa Sato, Taisuke Boku, Daisuke Takahashi, and Chikafumi Takahashi: Measurement and Characterization of Power Consumption of Microprocessors for Power-aware Computing, Proc. An International Symposium on Low-Power and High-Speed Chips (COOL Chips VI), Vol. 1, p. 77 (2003).
三浦信一，朴泰祐，佐藤三久，高橋大介：ユーザレベルでのマルチリンク利用による高バンド幅/耐故障性を持つクラスタ向け結合ネットワークRI2N，情報処理学会研究報告，2003-HPC-93，pp. 13-18 (2003).
大滝雄介，高橋大介，朴泰祐，佐藤三久：ヘテロなクラスタ環境におけるStrassenの行列積アルゴリズムの並列化，情報処理学会研究報告，2003-HPC-93，pp. 7-12 (2003).
佐藤三久，朴泰祐，高橋大介：OmniRPC: グリッド環境での並列プログラミングのためのGrid RPCシステム，情報処理学会研究報告，2002-HPC-92，pp. 37-42 (2002).
石川裕，高橋大介，朴泰祐，佐藤三久：ItaniumプロセッサによるSCoreクラスタ構築に関する検討，情報処理学会研究報告，2002-HPC-92，pp. 1-6 (2002).
高橋睦史，近藤正章，朴泰祐，高橋大介，中村宏，佐藤三久：HPC向けオンチップメモリプロセッサアーキテクチャSCIMAのSMP化の検討と性能評価，情報処理学会研究報告，2002-ARC-149，pp. 43-48 (2002).
小島好紀，佐藤三久，原田浩，石川裕，朴泰祐，高橋大介：Ethernetによるクラスタ上での分散共有メモリOpenMP Omni/SCASHの性能評価，情報処理学会研究報告，2002-HPC-91，pp. 119-124 (2002).
高橋大介，佐藤三久，朴泰祐：SR8000におけるOpenMPベンチマーク，情報処理学会研究報告，2002-ARC-147，2002-HPC-89，pp. 139-144 (2002).
後藤祐一，劉欣，小出雅人，高橋大介，程京徳：電子投票・アンケートはe-サービスになれるか，情報処理学会ソフトウェア工学研究会2001年度ウィンターワークショップ論文集，pp. 63-64 (2002).
高橋大介：SMPクラスタにおける並列FFTのブロックアルゴリズム，情報処理学会研究報告, 2001-HPC-87，pp. 37-42 (2001).
高橋大介：拡張split-radix FFTアルゴリズム，情報処理学会研究報告, 2000-HPC-83，pp. 25-30 (2000).
高橋大介，金田康正：積和演算に向いた8基底FFT Kernelの提案，情報処理学会研究報告, 99-HPC-76，pp. 55-60 (1999).
後保範，金田康正，高橋大介：無限級数に基づく多数桁計算の演算量削減を実現する分割有理数化法，京都大学数理解析研究所講究録, No. 1084, pp. 60-71 (1999).
高橋大介，金田康正：分散メモリ型並列計算機における円周率の高精度計算，情報処理学会研究報告, 97-HPC-67，pp. 19-24 (1997).
高橋大介，金田康正：並列計算機における二次記憶を用いた一次元FFTの実現と評価，情報処理学会研究報告, 97-ARC-123，pp. 7-12 (1997).
高橋大介，金田康正：分散メモリ型並列計算機による多倍長平方根の高速計算法，情報処理学会研究報告, 96-HPC-63，pp. 19-24 (1996).
高橋大介，金田康正：分散メモリ型並列計算機による2, 3, 5基底のFFTの実現と評価，情報処理学会研究報告 96-HPC-62，pp. 117-122 (1996).
高橋大介，金田康正：分散メモリ型並列計算機による高速多倍長計算，情報処理学会研究報告, 96-HPC-60，pp. 31-36 (1996).
高橋大介，金田康正：円周率—高速計算法と統計性—(3)，情報処理学会第37回プログラミングシンポジウム報告書, pp. 73-84 (1996).
高橋大介，金田康正：多倍長平方根の高速計算法，情報処理学会研究報告, 95-HPC-58，pp. 51-56 (1995).
高橋大介，湯淺太一：SIMD型超並列計算機におけるリバモアループの並列化とその評価，情報処理学会第47回（平成5年後期）全国大会講演論文集（6）, pp. 75-76 (1993).
高橋大介，湯淺太一：SIMD型超並列計算機におけるFFTアルゴリズム，1993年電子情報通信学会春季大会講演論文集（6）, pp. D-155 (1993).

6. 口頭発表

Daisuke Takahashi: Automatic Tuning for Parallel Number-Theoretic Transforms on GPU Clusters, SIAM Conference on Parallel Processing for Scientific Computing (PP26), Zuse Institute Berlin and Free University of Berlin, Berlin, Germany, March 6, 2026.
Daisuke Takahashi: Implementation of Parallel 3-D Real FFT with 2-D Decomposition on Manycore Clusters, The 14th AIMS Conference, ADNEC Centre Abu Dhabi, Abu Dhabi, UAE, December 20, 2024.
Daisuke Takahashi: Implementation of Parallel Number-Theoretic Transform on GPU Clusters, SIAM Conference on Parallel Processing for Scientific Computing (PP24), Lord Baltimore Hotel, Baltimore, Maryland, USA, March 7, 2024.
Daisuke Takahashi: Multiple Integer Divisions with an Invariant Dividend, 10th International Congress on Industrial and Applied Mathematics (ICIAM 2023), Waseda University, Shinjuku-ku, Tokyo, Japan, August 21, 2023.
Daisuke Takahashi: Implementation of Parallel Number-Theoretic Transform on Manycore Clusters, SIAM Conference on Computational Science and Engineering (CSE23), RAI Congress Centre, Amsterdam, The Netherlands, February 27, 2023.
Daisuke Takahashi: Parallel Implementation of FFT in a Finite Field, SIAM Conference on Parallel Processing for Scientific Computing (PP22), Online, February 26, 2022.
Daisuke Takahashi: Automatic Tuning of Computation-Communication Overlap for Parallel 3-D FFT with 2-D Decomposition, SIAM Conference on Computational Science and Engineering (CSE21), Online, March 4, 2021.
Daisuke Takahashi: Implementation of Parallel 3-D Real FFT with 2-D Decomposition on Intel Xeon Phi Clusters, SIAM Conference on Parallel Processing for Scientific Computing (PP20), Hyatt Regency Seattle, Seattle, Washington, USA, February 14, 2020.
Daisuke Takahashi: Implementation of Parallel 3-D Real FFT with 2-D Decomposition on Intel Xeon Phi Clusters, SIAM Conference on Computational Science and Engineering (CSE19), Spokane Convention Center, Spokane, Washington, USA, March 1, 2019.
Daisuke Takahashi: Implementation of Parallel 1-D Real FFT on Intel Xeon Phi Processors, 2018 Conference on Advanced Topics and Auto Tuning in High-Performance and Scientific Computing (2018 ATAT in HPSC), National Cheng Kung University, Tainan, Taiwan, March 27, 2018.
Ayumu Gomi and Daisuke Takahashi: A Programming Framework for Performance Tuning in Julia, SIAM Conference on Parallel Processing for Scientific Computing (PP18), Waseda University, Shinjuku-ku, Tokyo, Japan, March 7, 2018.
Daisuke Takahashi: Implementation of Parallel FFTs on Cluster of Intel Xeon Phi Processors, SIAM Conference on Parallel Processing for Scientific Computing (PP18), Waseda University, Shinjuku-ku, Tokyo, Japan, March 7, 2018.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs on Cluster of Intel Xeon Phi processors, 2017 Conference on Advanced Topics and Auto Tuning in High-Performance and Scientific Computing (2017 ATAT in HPSC), National Taiwan University, Taipei, Taiwan, March 10, 2017.
Daichi Mukunoki, Toshiyuki Imamura, and Daisuke Takahashi: Implementation Techniques for High Performance BLAS Kernels on Modern GPUs, SIAM Conference on Computational Science and Engineering (CSE17), Hilton Atlanta, Atlanta, Georgia, USA, February 28, 2017.
Daisuke Takahashi: Implementation of Parallel FFTs on Knights Landing Cluster, SIAM Conference on Computational Science and Engineering (CSE17), Hilton Atlanta, Atlanta, Georgia, USA, February 28, 2017.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs on Intel Xeon Phi Clusters, SIAM Conference on Parallel Processing for Scientific Computing (PP16), Universite Pierre et Marie Curie, Cordeliers Campus, Paris, France, April 14, 2016.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs on Intel Xeon Phi Clusters, 2016 Conference on Advanced Topics and Auto Tuning in High-Performance and Scientific Computing (2016 ATAT in HPSC), National Taiwan University, Taipei, Taiwan, February 19, 2016.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs on GPU Clusters, 2015 SIAM Conference on Computational Science and Engineering (CSE15), Salt Palace Convention Center, Salt Lake City, Utah, USA, March 18, 2015.
Hiroshi Maeda and Daisuke Takahashi: Performance Evaluation of Sparse Matrix-Vector Multiplication Using GPU/MIC Cluster, 2015 SIAM Conference on Computational Science and Engineering (CSE15), Salt Palace Convention Center, Salt Lake City, Utah, USA, March 14, 2015.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs on GPU Clusters, 2015 Conference on Advanced Topics and Auto Tuning in High-Performance and Scientific Computing (2015 ATAT in HPSC), National Taiwan University, Taipei, Taiwan, February 28, 2015.
Daisuke Takahashi: Implementation of Parallel FFTs on GPU Clusters, 2014 Conference on Advanced Topics and Auto Tuning in High Performance and Scientific Computing (2014 ATAT in HPSC), National Taiwan University, Taipei, Taiwan, March 14, 2014.
Daisuke Takahashi: Experience of Implementing Parallel FFTs on GPU Clusters, Special Session: Legacy HPC Application Migration 2013 (LHAM) (held in conjunction with IEEE MCSoC-13), National Institute of Informatics, Chiyoda-ku, Tokyo, Japan, September 27, 2013.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs, 2013 Conference on Advanced Topics and Auto Tuning in High Performance and Scientific Computing (2013@^2HPSC), National Taiwan University, Taipei, Taiwan, March 28, 2013.
椋木大地，高橋大介：GPUにおける3倍精度演算と4倍精度疎行列反復解法，第3回多倍長精度計算フォーラム，工学院大学，東京都新宿区，2013年3月8日．
Daichi Mukunoki and Daisuke Takahashi: Iterative Method for Sparse Linear Systems using Quadruple Precision Operations on GPUs, 2013 SIAM Conference on Computational Science and Engineering (CSE13), The Westin Boston Waterfront, Boston, Massachusetts, USA, February 28, 2013.
Daisuke Takahashi, Alex Yee, Torsten Hoefler, Camille Coti, Jeongnim Kim, and Franck Cappello: An Implementation of Parallel 3-D FFT with 1.5-D Decomposition, The seventh workshop of the INRIA-Illinois-ANL Joint Laboratory on Petascale Computing, INRIA Rennes, France, June 14, 2012.
中山空星，高橋大介：GPU上における多倍長精度浮動小数点演算の実装，第2回多倍長精度計算フォーラム，工学院大学，東京都新宿区，2011年12月10日．
Daisuke Takahashi, Alex Yee, Torsten Hoefler, Camille Coti, Jeongnim Kim, and Franck Cappello: A Scalable Parallel Algorithm for 3-D FFT, The sixth workshop of the INRIA-Illinois Joint Laboratory on Petascale Computing, National Center for Supercomputing Applications, Urbana, Illinois, USA, November 22, 2011.
椋木大地，高橋大介：GPUによる4倍精度行列計算，2011年並列／分散／協調処理に関する『鹿児島』サマー・ワークショップ（SWoPP鹿児島2011），かごしま県民交流センター，鹿児島市，2011年7月27日．
Yuji Kubota and Daisuke Takahashi: Autotuning of Sparse Matrix-Vector Multiplication by Selecting Storage Schemes on GPU, 2011 SIAM Conference on Computational Science and Engineering (CSE11), Grand Sierra Resort and Casino, Reno, Nevada, USA, March 1, 2011.
Daisuke Takahashi, Camille Coti, and Franck Cappello: Optimization of a Parallel 3-D FFT with 2-D Decomposition, The fourth workshop of the INRIA-Illinois Joint Laboratory on Petascale Computing, National Center for Supercomputing Applications, Urbana, Illinois, USA, November 23, 2010.
高橋大介：最近の円周率計算，数学系月例談話会，筑波大学自然系学系棟D棟，つくば市，2010年10月7日．
Daisuke Takahashi: Automatic Tuning for Parallel 3-D FFTs, 2010 SIAM Annual Meeting (AN10), David L. Lawrence Convention Center, Pittsburgh, Pennsylvania, USA, July 16, 2010.
Daisuke Takahashi: Automatic Tuning for Parallel 3-D FFT with 2-D Decomposition, 2010 SIAM Conference on Parallel Processing for Scientific Computing (PP10), Grand Hyatt Seattle, Seattle, Washington, USA, February 25, 2010.
Daisuke Takahashi: A Volumetric 3-D FFT on Clusters of Multi-Core Processors, Third French-Japanese PAAP Workshop, Shiran-Kaikan Hall Annex, Kyoto, Japan, April 21, 2009.
Daisuke Takahashi: A Volumetric 3-D FFT on Clusters of Multi-Core Processors, 2009 SIAM Conference on Computational Science and Engineering (CSE09), Miami Hilton Downtown, Miami, Florida, USA, March 5, 2009.
横澤拓弥，高橋大介：大規模密行列に対する古典Gram-Schmidt直交化の高速化，第2回先進スーパーコンピューティング環境研究会（ASE研究会），東京大学情報基盤センター，東京都文京区，2008年8月20日．
Daisuke Takahashi: Automatic Tuning for Parallel FFTs, Second French-Japanese PAAP Workshop, ENSEEIHT-IRIT, Toulouse, France, June 24, 2008.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs, 13th SIAM Conference on Parallel Processing for Scientific Computing (PP08), The Renaissance Atlanta Hotel Downtown, Atlanta, Georgia, USA, March 12, 2008.
Daisuke Takahashi: The FFTE Library and the HPC Challenge (HPCC) Benchmark Suite, First French-Japanese PAAP Workshop, Next-Generation Supercomputer R&D Center, RIKEN, Chiyoda-ku, Tokyo, Japan, November 2, 2007.
高橋大介：高精度数学定数の特定桁を計算するBBP型公式の高速計算法，第4回計算数学研究会，コープイン京都，京都市，2006年12月20日．
高橋大介：高速フーリエ変換の並列化ライブラリ，AISTスーパークラスタ成果報告会，秋葉原コンベンションホール，東京都千代田区，2005年4月25日．
高橋大介：高速フーリエ変換の並列アルゴリズム，未来開拓推進事業「計算科学」第3回ワークショップ「計算科学におけるアルゴリズム－部分と統合」，東京大学物性研究所，柏市，2001年12月15日．

7. 招待講演

Daisuke Takahashi: Implementation of Parallel 3-D FFT with 2-D Decomposition on GPU Clusters, International Conference on Modern Mathematical Methods and High-Performance Computing in Science & Technology (M3HPCST-2026), GL Bajaj Group of Institutions, Mathura, India, January 28, 2026.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs on Cluster of Intel Xeon Phi Processors, Parallel Fast Fourier Transforms (PFFT) (held in conjunction with IEEE HiPC 2018), Radisson Blu Bengaluru Outer Ring Road, Bengaluru, India, December 17, 2018.
高橋大介：エクサスケール計算環境に向けた高速フーリエ変換のアルゴリズム，第13回名工大・核融合研共同セミナー，名古屋工業大学2号館，名古屋市，2018年7月26日．
Daisuke Takahashi: Sparse Matrix-Vector Multiplication on GPUs, International Workshop on Eigenvalue Problems: Algorithms; Software and Applications, in Petascale Computing (EPASA2015), Tsukuba International Congress Center, Tsukuba, Japan, September 14, 2015.
Daisuke Takahashi: Automatic Tuning for Parallel FFTs on Clusters of Multi-Core Processors, Special Session: Auto-Tuning for Multicore and GPU (ATMG) (held in conjunction with IEEE MCSoC-12), The University of Aizu, Aizu, Japan, September 22, 2012.
高橋大介：スーパーコンピューティングの地デジ基礎技術への活用例，第19回中部放送技術フォーラム，NHK名古屋放送局，名古屋市，2010年5月20日．
Daisuke Takahashi: Parallel Implementation of Multiple-Precision Arithmetic and 2.576 Trillion Digits of Pi Calculation on a Massively Parallel Cluster of Multi-Core Processors, Workshop on Ultra Performance and Dependable Acceleration Systems (held in conjunction with PDCAT'09), Gakushi-kaikan, Hiroshima University, Higashi-Hiroshima, Japan, December 11, 2009.
高橋大介：FFTの高速化について，第一原理勉強会，東京大学理学部7号館，東京都文京区，2009年4月16日．
高橋大介：高速フーリエ変換の超並列計算に向けて，スーパーコンピューターワークショップ2009，自然科学研究機構岡崎コンファレンスセンター，岡崎市，2009年1月20日．
高橋大介：ペタスケール時代に向けたHPCプログラミング，IBM天城HPCセミナー2008，天城ホームステッド，伊豆市，2008年12月7日．
Daisuke Takahashi: Performance Evaluation of Linpack on T2K-Tsukuba System, Cray Technical Workshop Japan 2008, Hotel Laforet Tokyo, Shinagawa-ku, Tokyo, Japan, October 9, 2008.
高橋大介：並列数値計算アルゴリズムとその性能について，大規模・高精度電子状態計算手法に関する研究会，大阪大学産業科学研究所，茨木市，2006年7月13日．
高橋大介：並列FFTアルゴリズムの現状，スーパーコンピュータユーザー会，九州大学情報基盤センター，福岡市，2002年7月31日．
高橋大介：高速Fourier変換の現状，「地球規模流動現象解明のための計算科学－数理・物理モデルと計算アルゴリズムの開発－」第7回研究会，名古屋大学ベンチャー・ビジネス・ラボラトリー，名古屋市，1999年12月21日．
高橋大介：最近の円周率計算，呉工業高等専門学校，呉市，1997年11月28日．

8. 解説，報告

高橋大介：GPUクラスタにおける並列FFTの自動チューニング，計算工学, Vol. 20, No. 2, pp. 7-10 (2015).
高橋大介：クラスタ上でのプログラミング，電子情報通信学会知識ベース，6群5編7章2節 (2011).
高橋大介：マルチコア超並列環境におけるFFTの自動チューニング，応用数理, Vol. 20, No. 4, pp. 7-14 (2010).
高橋大介：円周率世界記録更新-2兆5769億8037万桁への道，情報処理, Vol. 50, No. 12, pp. 1228-1234 (2009).
高橋大介：SC|05報告，計算工学, Vol. 11, No. 2, pp. 25-26 (2006).
高橋大介：計算機システムの性能評価とプログラムチューニング（後編），情報処理, Vol. 42, No. 12, pp. 1226-1230 (2001).
高橋大介：計算機システムの性能評価とプログラムチューニング（前編），情報処理, Vol. 42, No. 11, pp. 1092-1097 (2001).

9. 著書

寒川光，高橋大介：有理算術演算―高精度数値計算のためのアルゴリズムとプログラミング―，森北出版 (2023).
今村俊幸，荻田武史，尾崎克久，片桐孝洋，須田礼仁，高橋大介，滝沢寛之，中島研吾：ソフトウェア自動チューニング―科学技術計算のためのコード最適化技術―，森北出版 (2021).
Daisuke Takahashi: Fast Fourier Transform Algorithms for Parallel Computers, Springer (2019).
岩下武史，片桐孝洋，高橋大介：スパコンを知る: その基礎から最新の動向まで，東京大学出版会 (2015).
寒川光，藤野清次，長嶋利夫，高橋大介：IT Text HPCプログラミング，オーム社 (2009).

10. 著書（分担執筆）

Daisuke Takahashi: Fast Fourier Transform in Large-Scale Systems, Masaaki Geshi (Ed.): The Art of High Performance Computing for Computational Science, Vol. 1, Springer, pp. 137-168 (2019).
Taisuke Boku, Osamu Tatebe, Daisuke Takahashi, Kazuhiro Yabana, Yuta Hirokawa, Masayuki Umemura, Toshihiro Hanawa, Kengo Nakajima, Hiroshi Nakamura, Tsuyoshi Ichimura, Kohei Fujita, Yutaka Ishikawa, Mitsuhisa Sato, Balazs Gerofi, and Masamichi Takagi: Oakforest-PACS: Advanced KNL Cluster System, Jeffrey S. Vetter (Ed.): Contemporary High Performance Computing: From Petascale toward Exascale, Vol. 3, CRC Press, pp. 401-421 (2019).
Hiroyuki Takizawa, Reiji Suda, Daisuke Takahashi, and Ryusuke Egawa: Xevolver: A User-Defined Code Transformation Approach to Streamlining Legacy Code Migration, Mitsuhisa Sato (Ed.): Advanced Software Technologies for Post-Peta Scale Computing, Springer, pp. 163-181 (2019).
高橋大介：大規模系での高速フーリエ変換，下司雅章（編）：計算科学のためのHPC技術2，大阪大学出版会, pp. 102-138 (2017).
Daisuke Takahashi: Automatic Tuning for Parallel FFTs, Ken Naono, Keita Teranishi, John Cavazos, and Reiji Suda (Eds.): Software Automatic Tuning: From Concepts to State-of-the-Art Results, Springer, pp. 49-67 (2010).
Daisuke Takahashi: Implementation of Multiple-Precision Parallel Division and Square Root on Distributed-Memory Parallel Computers, Yi Pan and Laurence T. Yang (Eds.): Parallel and Distributed Scientific and Engineering Computing: Practice and Experience, Nova Science Publishers, pp. 35-49 (2004).