概要
データマイニング (data mining; DM) と機械学習 (machine learning; ML) とリンク解析/線形代数 (link analysis/linear algebra; LA) に関する勉強会.
火曜日: データマイニング・機械学習・リンク解析関連のテュートリアル/入門書の輪講
金曜日: データマイニング・機械学習・リンク解析関連の文献紹介
日時
時間: 火曜日 17:00-~, 金曜日 19:00-
場所: A707
2005年度
文献紹介, 研究の進捗発表
2/24 (金) 松本
Vojech Franc and Vaclav Hlavac
Multi-class Support Vector Machine
ICPR'02, pp.236-239, 2002.
2/17 (金) 新保
Dengyong Zhou, Bernhard Sch{\"o}lkopf and Thomas Hofmann.
Semi-supervised learning on directed graphs.
Advances in Neural Information Processing Systems 17: Proc. NIPS*2004 Conference, MIT Press, 2005.
2/10 (金) 伊藤
Xuehua Shen, Bin Tan, ChengXiang Zhai,
Context-Sensitive Information Retrieval with Implicit Feedback,
Proceedings of ACM SIGIR 2005.
2/3 (金) 原
Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms
Michael Collins
Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL held in conjunction with ACL 2002
1/27 (金) 新保
Andrew McCallum, Kedar Bellare and Fernando Pereira
A conditional random field for discriminatively-trained finite-state string edit distance
Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence, 2005
1/20 (金) 伊藤
Tamara G. Kolda, Brett W. Bader, and Joseph P. Kenny,
Higher-Order Web Link Analysis Using Multilinear Algebra,
ICDM-2005.
1/13 (金) 福岡(け)
Altun, Y., Johnson, M. and Hofmann, T.,
"Investigating Loss Functions and Optimization Methods for Discriminative Learning of Label Sequences"
EMNLP 2003.
12/16 (金) 竹原
David M. Blei, John D. Lafferty,
"Correlated Topic Models" NIPS 2005
12/9 (金) 賀沢
Qi Zhao and David J. Miller,
"Mixture Modeling with Pairwise, Instance-Level Class Constraints"
Nueral Computation, vol.17, no.11, p.2482-2507 (2005)
12/2 (金) 河部
A. Globerson and N. Tishby.
Sufficient dimensionality reduction - a novel analysis method.
In ICML, pages 203--210. IEEE Service Center, July 2002.
11/29 (火) 松本
Altun, Y., McAllester, D. and Belkin, M.,
Maximum Margin Semi-Supervised Learning for Structured Variable
to appear in NIPS 2005.
11/22 (火) 福岡(け)
W. Cohen and S. Sarawagi,
Exploiting dictionaries in named entity extraction: Combining semi-markov extraction processes and data integration methods
In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004.
11/15 (火) 伊藤
Koji Tsuda and William Stafford Noble
Learning kernels from biological networks by maximizing entropy
Bioinformatics (2004).
11/11(金) 新保
Wensi Xi and Benyu Zhang and Zheng Chen and Yizhou Lu and Shuicheng Yan and Wei-Ying Ma and Edward A. Fox
Link fusion: a unified link analysis framework for multi-type interrelated data objects
WWW 2004
11/8 (火) 東
Andrew Smith, Trevor Cohn, Miles Osborne
Logarithmic Opinion Pools for Conditional Random Fields
In Proceedings of 43rd Annual Meeting of Association for Computational Linguistics, 2005.
11/4(金) 中川
Dan Roth and Wen-tau Yih
A Linear Programming Formulation for Global Inference in Natural Language Tasks
Proceedings of CoNLL-2004
11/1 (火) 原
Hiroto Saigo, Jean-Philippe Vert, Nobuhisa Ueda, Tatsuya Akutsu
Protein homology detection using string alignment kernels
Bioinformatics 20(11): 1682-1689 (2004)
10/28(金) 上野
Matthew Schultz and Thorsten Joachims.
Learning a Distance Metric from Relative Comparisons.
NIPS 2003.
10/25 (火) 渡邉
Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Ying Ma
2D Conditional Random Fields for Web Information Extraction
In Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany, 2005.
10/21(金) 木村
S. Wang, D. Schuurmans, F. Peng and Y. Zhao
Learning Mixture Models with the Latent Maximum Entropy Principle
The 20th International Conference on Machine Learning, ICML-2003
10/18 (火) 近藤
Gianna M.Del Corso, Antonio Gulli and Francesco Romani
Ranking a Stream of News
In Proc. WWW 2005, Chiba.
10/7 (金) 伊藤
進捗報告
「引用解析からみた文書分類用カーネル」
07/22 (金) 持橋(ATR)
Max Welling, Richard S. Zemel, and Geoffrey E. Hinton.
"Self Supervised Boosting". NIPS 2002.
07/15 (金) 新保
T. Joachims.
Learning to Align Sequences: A Maximum-Margin Approach, Technical Report, August, 2003.
http://www.cs.cornell.edu/People/tj/publications/joachims_03b.pdf
07/08 (金) 松本
Supervised Clustering with Support Vector Machines
Thomas Finley and Thorsten Joachims
ICML 2005.
07/01 (金) 伊藤
Zoltan Gyongyi. Hector Gracia-Molina. Jan Pedersen.
Combating Web Spam with TrustRank
In Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004.
06/24 (金) 東
Friedman, J. H.
Greedy function approximation: A gradient boosting machine.
Annals of Statistics, 29. 2001
06/17 (金) 福岡
Andrew McCallum, Ben Wellner.
Toward Conditional Models of identity uncertainty with Application to Proper nou n Coreference.
In Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web.
http://www.cs.umass.edu/~mccallum/papers/condid-ijcaiws2003.pdf
06/10 (金) 河部
E. P. Xing, A. Y. Ng, M. I. Jordan, and S. Russell,
Distance metric learning, with application to clustering with side-information.
In Advances in NIPS, Vol. 15. MIT Press.
06/03 (金) 新保
Wei Fan, Salvatore J. Stolfo, Junxin Zhang, and Philip K. Chan.
AdaCost: misclassification cost-sensitive boosting.
In Proc. 16th International Conf. on Machine Learning (ICML-1999).
05/20 (金) 浅原
D. Klein, S. D. Kamvar & C. D. Manning.
From Instance-level Constraints to Space-Level Constraints: Making the Most of Prior Knowledge in Data Clustering, in Proc. of ICML-2002.
http://www.cs.berkeley.edu/~klein/papers/constrained_clustering-ICML_2002.pdf
05/13 (金) 松本
Rong Jin, Huan Liu,
Robust Feature Induction for Support Vector Machines.
In Proceedings of the Twenty-First International Conference on Machine Learning (ICML 2004), pp.449-456, 2004.
04/26 (火) 原
K. Tsuda, T. Kin, and K. Asai.
Marginalized kernels for biological sequences. Bioinformatics, 18(Suppl. 1):S268-S275, 2002.
04/15 (金) 伊藤
Thomas Hofmann.
Learning the Similarity of Documents: An Information-Geometric Approach to Document Retrieval and Categorization.
In Advances in Neural Information Processing Systems 12 (NIPS 2000), MIT Press.
輪読:
2/7 (火) 1/31 (火) 1/24 (火) 上野 1/17 (火) 渡辺 1/10 (火) 近藤 12/20 (火) 木村 12/13 (火) 松本
輪読: Carl D. Meyer. Matrix Analysis and Applied Linear Algebra
Chapter 5 Norms, Inner Products, and Orthogonality
06/14 (火) 新保 Sections 5-5.2
06/21 (火) 松本 Sections 5.3-5.5
06/28 (火) 上野 Section 5.6
5.8 は飛ばす.
07/5 (火) 東 Section 5.7
07/12 (火) 東 Section 5.9, 原 Section 5.10
07/19 (火) 原 Section 5.11 伊藤 Section 5.12
Chapter 6 Determinants
Chapter 7 Eigenvalues and Eigenvectors
Chapter 8 Perron-Frobenius Theory of Nonnegative Matrices
新入生向けテュートリアル
06/07 (火) 浅原
HMM, MEMM, CRF による系列解析 2005-06-07.pdf
05/31 (火) 6 月の企画会議
05/24 (火) 新保: Lagrange 乗数法, 双対理論
D. P. Bertsekas, Nonlinear Programming, 2nd ed., Athena Scientific, Belmont, MA, USA, Chapter 5.
05/17 (火) 原: 最大エントロピーモデル
北 研二
確率的言語モデル 第6章
東京大学出版会
05/10 (火) 伊藤: Support Vector Machine
N. Cristianini, and J. Shawe-Taylor.
An Introduction of Support Vector Machines and other kernel-based learning methods.
Cambridge University Press.
Sections 6.1-6.2
04/22 (金) 持橋(ATR)
「変分ベイズ法とLDAのチュートリアル」
参考資料:岩波 計算統計I
過去の勉強会
- DMLA
- DataMining勉強会
- SVM勉強会