ENGLISH    |  

DMLA/2005年度

概要

データマイニング (data mining; DM) と機械学習 (machine learning; ML) とリンク解析/線形代数 (link analysis/linear algebra; LA) に関する勉強会.

火曜日: データマイニング・機械学習・リンク解析関連のテュートリアル/入門書の輪講
金曜日: データマイニング・機械学習・リンク解析関連の文献紹介

日時

時間: 火曜日 17:00-~, 金曜日 19:00-
場所: A707

2005年度

文献紹介, 研究の進捗発表

2/24 (金) 松本

Vojech Franc and Vaclav Hlavac

Multi-class Support Vector Machine

ICPR'02, pp.236-239, 2002.

2/17 (金) 新保

Dengyong Zhou, Bernhard Sch{\"o}lkopf and Thomas Hofmann.

Semi-supervised learning on directed graphs.

Advances in Neural Information Processing Systems 17: Proc. NIPS*2004 Conference, MIT Press, 2005.

2/10 (金) 伊藤

Xuehua Shen, Bin Tan, ChengXiang Zhai,

Context-Sensitive Information Retrieval with Implicit Feedback,

Proceedings of ACM SIGIR 2005.

2/3 (金) 原

Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms

Michael Collins

Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL held in conjunction with ACL 2002

1/27 (金) 新保

Andrew McCallum, Kedar Bellare and Fernando Pereira

A conditional random field for discriminatively-trained finite-state string edit distance

Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence, 2005

1/20 (金) 伊藤

Tamara G. Kolda, Brett W. Bader, and Joseph P. Kenny,

Higher-Order Web Link Analysis Using Multilinear Algebra,

ICDM-2005.

1/13 (金) 福岡(け)

Altun, Y., Johnson, M. and Hofmann, T.,

"Investigating Loss Functions and Optimization Methods for Discriminative Learning of Label Sequences"

EMNLP 2003.

12/16 (金) 竹原

David M. Blei, John D. Lafferty,

"Correlated Topic Models" NIPS 2005

http://www.cs.cmu.edu/~lafferty/pub/ctm.pdf

12/9 (金) 賀沢

Qi Zhao and David J. Miller,

"Mixture Modeling with Pairwise, Instance-Level Class Constraints"

Nueral Computation, vol.17, no.11, p.2482-2507 (2005)

12/2 (金) 河部

A. Globerson and N. Tishby.

Sufficient dimensionality reduction - a novel analysis method.

In ICML, pages 203--210. IEEE Service Center, July 2002.

http://citeseer.ist.psu.edu/globerson02sufficient.html

11/29 (火) 松本

Altun, Y., McAllester, D. and Belkin, M.,

Maximum Margin Semi-Supervised Learning for Structured Variable

to appear in NIPS 2005.

11/22 (火) 福岡(け)

W. Cohen and S. Sarawagi,

Exploiting dictionaries in named entity extraction: Combining semi-markov extraction processes and data integration methods

In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004.

11/15 (火) 伊藤

Koji Tsuda and William Stafford Noble

Learning kernels from biological networks by maximizing entropy

Bioinformatics (2004).

11/11(金) 新保

Wensi Xi and Benyu Zhang and Zheng Chen and Yizhou Lu and Shuicheng Yan and Wei-Ying Ma and Edward A. Fox

Link fusion: a unified link analysis framework for multi-type interrelated data objects

WWW 2004

11/8 (火) 東

Andrew Smith, Trevor Cohn, Miles Osborne

Logarithmic Opinion Pools for Conditional Random Fields

In Proceedings of 43rd Annual Meeting of Association for Computational Linguistics, 2005.

11/4(金) 中川

Dan Roth and Wen-tau Yih

A Linear Programming Formulation for Global Inference in Natural Language Tasks

Proceedings of CoNLL-2004

11/1 (火) 原

Hiroto Saigo, Jean-Philippe Vert, Nobuhisa Ueda, Tatsuya Akutsu

Protein homology detection using string alignment kernels

Bioinformatics 20(11): 1682-1689 (2004)

10/28(金) 上野

Matthew Schultz and Thorsten Joachims.

Learning a Distance Metric from Relative Comparisons.

NIPS 2003.

10/25 (火) 渡邉

Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Ying Ma

2D Conditional Random Fields for Web Information Extraction

In Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany, 2005.

10/21(金) 木村

S. Wang, D. Schuurmans, F. Peng and Y. Zhao

Learning Mixture Models with the Latent Maximum Entropy Principle

The 20th International Conference on Machine Learning, ICML-2003

http://www.cs.ualberta.ca/%7Eswang/icml2003.ps

10/18 (火) 近藤

Gianna M.Del Corso, Antonio Gulli and Francesco Romani

Ranking a Stream of News

In Proc. WWW 2005, Chiba.

10/7 (金) 伊藤

進捗報告

「引用解析からみた文書分類用カーネル」

07/22 (金) 持橋(ATR)

Max Welling, Richard S. Zemel, and Geoffrey E. Hinton.

"Self Supervised Boosting". NIPS 2002.

http://www.cs.toronto.edu/pub/zemel/Papers/UboostNips.pdf

07/15 (金) 新保

T. Joachims.

Learning to Align Sequences: A Maximum-Margin Approach, Technical Report, August, 2003.

http://www.cs.cornell.edu/People/tj/publications/joachims_03b.pdf

07/08 (金) 松本

Supervised Clustering with Support Vector Machines

Thomas Finley and Thorsten Joachims

ICML 2005.

07/01 (金) 伊藤

Zoltan Gyongyi. Hector Gracia-Molina. Jan Pedersen.

Combating Web Spam with TrustRank

In Proceedings of the 30th VLDB Conference, Toronto, Canada, 2004.

06/24 (金) 東

Friedman, J. H.

Greedy function approximation: A gradient boosting machine.

Annals of Statistics, 29. 2001

06/17 (金) 福岡

Andrew McCallum, Ben Wellner.

Toward Conditional Models of identity uncertainty with Application to Proper nou n Coreference.

In Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web.

http://www.cs.umass.edu/~mccallum/papers/condid-ijcaiws2003.pdf

06/10 (金) 河部

E. P. Xing, A. Y. Ng, M. I. Jordan, and S. Russell,

Distance metric learning, with application to clustering with side-information.

In Advances in NIPS, Vol. 15. MIT Press.

06/03 (金) 新保

Wei Fan, Salvatore J. Stolfo, Junxin Zhang, and Philip K. Chan.

AdaCost: misclassification cost-sensitive boosting.

In Proc. 16th International Conf. on Machine Learning (ICML-1999).

05/20 (金) 浅原

D. Klein, S. D. Kamvar & C. D. Manning.

From Instance-level Constraints to Space-Level Constraints: Making the Most of Prior Knowledge in Data Clustering, in Proc. of ICML-2002.

http://www.cs.berkeley.edu/~klein/papers/constrained_clustering-ICML_2002.pdf

05/13 (金) 松本

Rong Jin, Huan Liu,

Robust Feature Induction for Support Vector Machines.

In Proceedings of the Twenty-First International Conference on Machine Learning (ICML 2004), pp.449-456, 2004.

http://www.public.asu.edu/~huanliu/papers/icml04.pdf

04/26 (火) 原

K. Tsuda, T. Kin, and K. Asai.

Marginalized kernels for biological sequences. Bioinformatics, 18(Suppl. 1):S268-S275, 2002.

http://www.cbrc.jp/%7Etsuda/ismb02.pdf

04/15 (金) 伊藤

Thomas Hofmann.

Learning the Similarity of Documents: An Information-Geometric Approach to Document Retrieval and Categorization.

In Advances in Neural Information Processing Systems 12 (NIPS 2000), MIT Press.

http://www.cs.brown.edu/people/th/papers/Hofmann-NIPS99.pdf

輪読:

2/7  (火) 
1/31 (火) 
1/24 (火) 上野
1/17 (火) 渡辺
1/10 (火) 近藤
12/20 (火) 木村
12/13 (火) 松本

輪読: Carl D. Meyer. Matrix Analysis and Applied Linear Algebra

Chapter 5 Norms, Inner Products, and Orthogonality

06/14 (火) 新保 Sections 5-5.2

06/21 (火) 松本 Sections 5.3-5.5

06/28 (火) 上野 Section 5.6

5.8 は飛ばす.

07/5 (火) 東 Section 5.7

07/12 (火) 東 Section 5.9, 原 Section 5.10

07/19 (火) 原 Section 5.11 伊藤 Section 5.12

Chapter 6 Determinants

Chapter 7 Eigenvalues and Eigenvectors

Chapter 8 Perron-Frobenius Theory of Nonnegative Matrices

新入生向けテュートリアル

06/07 (火) 浅原

HMM, MEMM, CRF による系列解析 file2005-06-07.pdf

05/31 (火) 6 月の企画会議

05/24 (火) 新保: Lagrange 乗数法, 双対理論

D. P. Bertsekas, Nonlinear Programming, 2nd ed., Athena Scientific, Belmont, MA, USA, Chapter 5.

http://www.athenasc.com/nonlinbook.html

05/17 (火) 原: 最大エントロピーモデル

北 研二

確率的言語モデル 第6章

東京大学出版会

05/10 (火) 伊藤: Support Vector Machine

N. Cristianini, and J. Shawe-Taylor.

An Introduction of Support Vector Machines and other kernel-based learning methods.

Cambridge University Press.

Sections 6.1-6.2

04/22 (金) 持橋(ATR)

「変分ベイズ法とLDAのチュートリアル」

参考資料:岩波 計算統計I

http://www.iwanami.co.jp/.BOOKS/00/2/0068510.html

過去の勉強会

リンク集