Personalized entity recommendation: A heterogeneous information network approach X Yu, X Ren, Y Sun, Q Gu, B Sturt, U Khandelwal, B Norick, J Han Proceedings of the 7th ACM international conference on Web search and data …, 2014 | 934 | 2014 |
Phi-3 technical report: A highly capable language model locally on your phone M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ... arXiv preprint arXiv:2404.14219, 2024 | 844 | 2024 |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022 | 656 | 2022 |
Pathselclus: Integrating meta-path selection with user-guided object clustering in heterogeneous information networks Y Sun, B Norick, J Han, X Yan, PS Yu, X Yu ACM transactions on knowledge discovery from data (TKDD) 7 (3), 1-23, 2013 | 441 | 2013 |
Recommendation in heterogeneous information networks with implicit user feedback X Yu, X Ren, Y Sun, B Sturt, U Khandelwal, Q Gu, B Norick, J Han Proceedings of the 7th ACM conference on Recommender systems, 347-350, 2013 | 244 | 2013 |
Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, and Bryan Catanzaro S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... Using deepspeed and megatron to train megatron-turing nlg 530b, a large …, 2022 | 148 | 2022 |
Large-scale embedding learning in heterogeneous event data H Gui, J Liu, F Tao, M Jiang, B Norick, J Han 2016 IEEE 16th International Conference on Data Mining (ICDM), 907-912, 2016 | 104 | 2016 |
User guided entity similarity search using meta-path selection in heterogeneous information networks X Yu, Y Sun, B Norick, T Mao, J Han Proceedings of the 21st ACM international conference on Information and …, 2012 | 76 | 2012 |
Embedding learning with events in heterogeneous information networks H Gui, J Liu, F Tao, M Jiang, B Norick, L Kaplan, J Han IEEE transactions on knowledge and data engineering 29 (11), 2428-2441, 2017 | 59 | 2017 |
An empirical study of mamba-based language models R Waleffe, W Byeon, D Riach, B Norick, V Korthikanti, T Dao, A Gu, ... arXiv preprint arXiv:2406.07887, 2024 | 55 | 2024 |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model (2022) S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 0 | 31 | |
Phi-3 technical report: A highly capable language model locally on your phone, 2024 M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ... URL https://arxiv. org/abs/2404.14219, 2024 | 30 | 2024 |
Effects of the number of developers on code quality in open source software: a case study B Norick, J Krohn, E Howard, B Welna, C Izurieta Proceedings of the 2010 ACM-IEEE International Symposium on Empirical …, 2010 | 22 | 2010 |
& Catanzaro, B.(2022). Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 0 | 22 | |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model. arXiv S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022 | 20 | 2022 |
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... A large-scale generative language model, 2022 | 17 | 2022 |
Newsnetexplorer: automatic construction and exploration of news information networks F Tao, G Brova, J Han, H Ji, C Wang, B Norick, A El-Kishky, J Liu, X Ren, ... Proceedings of the 2014 ACM SIGMOD International Conference on Management of …, 2014 | 16 | 2014 |
Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model. arXiv 2022 S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 0 | 12 | |
Active learning on heterogeneous information networks: A multi-armed bandit approach D Xin, A El-Kishky, D Liao, B Norick, J Han 2018 IEEE International Conference on Data Mining (ICDM), 1350-1355, 2018 | 9 | 2018 |
Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset D Su, K Kong, Y Lin, J Jennings, B Norick, M Kliegl, M Patwary, ... arXiv preprint arXiv:2412.02595, 2024 | 1 | 2024 |