ジェスチャ併用型 Voice-to-MIDI システムの提案 第五回知識創造支援システムシンポジウム報告書 : 本著作物の著作権は著者に帰属します

Similar documents
(Osaka Industrial Technology - Platform)

IEEE. s Magazine 電子情報通信学会誌 電気学会誌第 123 巻 4 号 年 4 月. IEEE Photonics Tech. Lett.,

車載カメラにおける信号機認識および危険運転イベント検知 Traffic Light Recognition and Detection of Dangerous Driving Events from Surveillance Video of Vehicle Camera

情Propagation Characteristics of 700MHz Band V2X Wireless Communication*

科学研究費助成事業 ( 科学研究費補助金 ) 研究成果報告書

23 May 2018, Galveson, TX Science of Team Science 2018 Conference Ge WANG 1,3 and Ken-ichi SATO 2,3

A Co-worker Robot PaDY" for Automobile Assembly Line

Implementation as a Trickle-down Process of Knowledge and Technology to a Local Community

科学技術 学術審議会大型プロジェクト作業部会 2015 年 12 月 22 日 永野博

The seven pillars of Data Science

Title inside of Narrow Hole by Needle-Typ. Issue Date Journal Article. Text version author.

Title of the body. Citation. Issue Date Conference Paper. Text version author. Right

l Reef in Ishigaki Island- Author(s) a, Tanouchi, Hiroki, Nasu, Seigo

Service Research and Innovation in Japan

Study in Patent Risk and Countermeasures Related to Open Management in Interaction Design

[1] 大橋和也, 森拓哉, 古関隆章 運転整理時における乗車率に応じた旅客行動の変化のモデル化 電気学会論文誌 D,J-Rail 2013 特集,2015,pp


樊晉源簡歷 元智大學 / 工業工程與管理研究所 / 博士 (2005/06/30~2009/06/30) 大葉大學 / 事業經營研究所 / 碩士 (2001/06/30~2003/06/30) 科技政策研究與資訊中心政策研究組副研究員 (2014/01/01~ 迄今 )

Installation Manual WIND TRANSDUCER

The Current State of Digital Healthcare

研究開発評価に関する国際的な視点や国際動向

レーダー流星ヘッドエコー DB 作成グループ (murmhed at nipr.ac.jp) 本規定は レーダー流星ヘッドエコー DB 作成グループの作成した MU レーダー流星ヘッド エコーデータベース ( 以下 本データベース ) の利用方法を定めるものである

ICTを活用した英語アカデミック ライティング指導 Title : 支援ツールの開発と実践 水本, 篤, 染谷, 泰正, 山本, 敏幸, 浜谷, 佐和子, Author(s) 小山, 由紀江, 近藤, 悠介, 今尾, 康裕, 大野, 真澄, 濱地, 亮太, 名部井, 敏代, 山西, 博之

When Manga Fans Become Pirates: The Art of Translating and Navigating Japanese Manga

小川憲一 京都大学医学部附属病院放射線部 1. 方法 1-1. A System of Setting Exposure Conditions in General X-rays Using a Calculating Formula

特集 米国におけるコンシューマ向けブロードバンド衛星サービスの現状

Onboard Antenna for 700 MHz Band V2X Communication *

Fig. 1. The polarimetric UWB GB-SAR system with circular polarization spiral antenna array.

Summer School on GNSS 2015

CER7027B / CER7032B / CER7042B / CER7042BA / CER7052B CER8042B / CER8065B CER1042B / CER1065B CER1242B / CER1257B / CER1277B

Indonesian Printing Industry Trends, Current Technology, and Future Development

TED コーパスを使った プレゼンにおける効果的な 英語表現の抽出

Citation 年次学術大会講演要旨集, 23: 本著作物は研究 技術計画学会の許可のもとに掲載するものです This material is posted here w

Measuring the performance of Knowledge Transfer from Universities to Industry in China. ZHONG Wei Renmin Univ

博士学位論文. Doctoral Thesis 内容の要旨 審査結果の要旨. Thesis Abstracts and Summaries of the Thesis Review Results. The Twelfth Issue. The University of Aizu

M. Khosarvy, M.R. Asharif, K. Yamashita, AN EFFICIENT ICA BASED APPROACH Multi-Carrier System, Vol. 41, pp.47-56, 2009

Big Data and High Performance Computing

Creation of Digital Archive of Japanese Products Design process

Fiber 鄄 coupled Diode Laser Flexible Processing Source for Metal Sheet Welding

Share patents, and they shall be given you: An empirical study on consequences of patent commons

JSPS Science Dialog Program Kofu Higashi High School

Chronicle of a Disaster: Understand

Present Status of SMEs I

第 1 回先進スーパーコンピューティング環境研究会 (ASE 研究会 ) 発表資料

Lesson 5 What The Last Supper Tells Us

ITU-R WP5D 第 9 回会合報告書

What to discuss about data?

ews 市民社会におけるガバナンスの教育研究拠点 Contents 慶 應 義 塾 大 学 グ ローバル C O E プ ログラム No.6 CGCS ニューズレター 2010.July

国際会議 ACM CHI ( ) HCI で生まれた研究例 2012/10/3 人とコンピュータの相互作用 WHAT IS HCI? (Human-Computer Interaction (HCI)

THE INSTRUCTION. Tetsuya Nishio Cup The Japan Number Place Championship 2009

Challenge for Analog Circuit Testing in Mixed-Signal SOC

Page No. 原文 リライト EDITOR'S NOTES 1 4 NATURAL ART

Final Product/Process Change Notification Document # : FPCN22191XD1 Issue Date: 24 January 2019

Noise Robust Optical Sensor for Driver s Vital Signs *

ロボティクスと深層学習. Robotics and Deep Learning. Keywords: robotics, deep learning, multimodal learning, end to end learning, sequence to sequence learning.

第 4 回トポス会議イノベーティング イノベーション - 日本のイノベーション のパラダイム シフト - Date/Time: Venue: Organizer: Sponsor: Co-sponsor:

Future Perspectives of Science, Technology and Innovation

SanjigenJiten : Game System for Acquiring New Languages Visually 三次元辞典 : 第二言語学習のためのゲームシステム. Robert Howland Emily Olmstead Junichi Hoshino

Supporting Communications in Global Networks. Kevin Duh & 歐陽靖民

Effective Utilization of Patent Information in Japanese global companies

FUMIKO HAYASHI. Mayor of the City of Yokohama

修士 / 博士課程専門課題 Ⅱ 試験問題

Ⅲ. 研究成果の刊行に関する一覧表 発表者氏名論文タイトル名発表誌名巻号ページ出版年. lgo/kourogi_ pedestrian.p df. xed and Augmen ted Reality

アルゴリズムの設計と解析. 教授 : 黄潤和 (W4022) SA: 広野史明 (A4/A8)

新着資料リスト (2007 年 8 月 )/A list of new arrivals in August, 2007

How Capturing the Movement of Ions can Contribute to Brain Science and Improve Disease Diagnosis

Intermediate Conversation Material #10

Human-Robot Interaction from Dance Partner Robot to Co-worker Robot

平成 31 年度英語実技検査 ( リスニングテスト )

L1 Cultures Go Around the World

製品系列統合化設計とそのタスク構造 日本機械学会論文集 C 編. 65(629) P.416-P

相関語句 ( 定型のようになっている語句 ) の表現 1. A is to B what C is to D. A と B の関係は C と D の関係に等しい Leaves are to the plant what lungs are to the animal.

Private Equity: where should you invest today? P&I Global Pension Symposium, Tokyo

Toward a new era of EU-Japan cooperation in Robotics: rationale and objectives

Future plan of JAMSTEC Argo - Core Argo and Argo extensions -

アジアの企業家精神全 5 巻 Asian Entrepreneurship. 5 vols.

Development of XML and IP Based Distributed Ground Station System for Pico / Small Satellite

U N I T. 1. What are Maxine and Debbie talking about? They are talking about. 2. What doesn t Maxine like? She doesn t like. 3. What is a shame?

Non-uniform Selective Way Cache の動的制御による組込みプロセッサの省エネルギー化

行政院國家科學委員會專題研究計畫成果報告

Radiometric calibration for ASTER-VNIR and HISUI in AIST

Toward The Organisational Innovation Study: A Critical Study of Previous Innovation Research

venteon Ultra-short pulse oscillators

外国語作文 ( 英語 ) Foreign Language Essay (English)

D80 を使用したオペレーション GSL システム周波数特性 アンプコントローラー設定. Arc 及びLine 設定ラインアレイスピーカーを2 から7 までの傾斜角度に湾曲したアレイセクションで使用する場合 Arcモードを用います Lineモード

TVP Group 会社紹介. Trust Venture Partners Co., Ltd.

Establishing an international cooperative strategy for the conservation of Oriental White Storks in Northeast Asia

FY 2013 Briefing Session for JST Strategic Basic Research Programs (CREST, PRESTO)

学術認証フェデレーションと連携. Wiley テキストシリーズ登場. Wileyより人気の教科書タイトルが新規配信 採用校も多い教科書タイトルも多数 各分野ごとにおススメのタイトルも多数ございます ぜひご覧ください

Corporate Education for Manufacturing (Semiconductors) - Creation of a training system and technical textbook -

Omochi rabbit amigurumi pattern

HARD LOCK Technical Reports

Omni LED Bulb. Illustration( 实际安装, 설치사례, 設置事例 ) Bulb, Downlight OBB. OBB-i15W OBB-i20W OBB-i25W OBB-i30W OBB-i35W. Omni LED.

HONG KONG SAR, CHINA

Programme プログラム International Symposium REvision 2012 New Renewable Direction for Japan

the toymakers 482CBDED42BA8C67DBD555D243A5B37D The Toymakers 1 / 6

電子回路論第 6 回 Electric Circuits for Physicists

On Endings 終結について. Ted Goossen

P (o w) P (o s) s = speaker. w = word. Independence bet. phonemes and pitch. Insensitivity to phase differences. phase characteristics

Statistical Tools for Digital Forensics. Information Technologies for IPR Protection

Transcription:

JAIST Reposi https://dspace.j Title ジェスチャ併用型 Voice-to-MIDI システムの提案 Author(s) 伊藤, 直樹 ; 西本, 一志 Citation 第五回知識創造支援システムシンポジウム報告書 : 167-172 Issue Date 2008-03-14 Type Conference Paper Text version author URL Rights http://hdl.handle.net/10119/4421 本著作物の著作権は著者に帰属します 第五回知識創造支援システムシンポジウム, 主催 : 日本創造学会, 北陸先端科学技術大学院大学, 共催 : 石川県産業創出支援機構文部科学省知的クラスター創成 Description 事業金沢地域 アウェアホームのためのアウェア技術の開発研究, 開催 : 平成 20 年 2 月 21 日 ~23 日, 報告書発行 : 平成 20 年 3 月 14 日 Japan Advanced Institute of Science and

1 Voice-to-MIDI A Voice-to-MIDI pitch input method with concurrently using tap gestures Naoki Itou Kazushi Nishimoto School of Knowledge Science, Japan Advanced Institute of Science and Technology n-itou@jaist.ac.jp, http://www.jaist.ac.jp/~n-itou/ Center for Knowledge Science, Japan Advanced Institute of Science and Technology knishi@jaist.ac.jp, http://www.jaist.ac.jp/~knishi/ keywords: Voice-to-MIDI, Gesture, Tapping, Rhythm segmentation, Pitch correction Summary Voice-to-MIDI, one of the input methods for MIDI sequence data, has a merit that users can input melodies intuitively. However, sometimes the quolity of pitch translation is not satisfactory. To solve this issue, we propose a method to correct such translation mistakes by concurrently using rhythm taps and gestures with the Voice-to-MIDI. Our method allows the users to input 3 level high / low / hold pitch transition information by tap gesture when they start to sing and tap. After singing and tapping, the note peers have the paradox between pitch transition by pitch translation algorithm and pitch transition by tap gesture are corrected by the pitch correction rules. We developped the prototype system and had the experiments to evaluate the pitch correction accuracy and its usability with 2 subjects. According to an example of the results, For total 4 paradoxical note peers, 1 peer was corrected propery, but others are corrected with mistakes. For our system, the subjects said it is heavy work that they should sing, tap and imagine the pitch transition concurrently. Our method shows some usable case, but we found the issues of our method and correction rules. 1. MIDI(Musical Instruments Digital Interface 1 MIDI MIDI PC 1 1 http://www.amei.or.jp/ Voice-to-MIDI [YAMAHA 03, INTERNET 06] Voice-to-MIDI Voice-to-MIDI 1 Voice-to- MIDI [ 06] Voice-to-MIDI Voice-to-MIDI

2 2 Voice-to-MIDI 2. 2 1 Voice-to-MIDI Voice-to-MIDI 2 1 2 [YAMAHA 03] [ 07] Voice-to-MIDI QBH(Query-by-Humming) [Lutz 01 Alexandra 99 Sonoda 98] [ 02] 2 Voice-to-MIDI 2 Voice-to-MIDI 1 2 2 Voice-to-MIDI Voice-to-MIDI Voice-to-MIDI 1 Voice-to-MIDI 1 PC 1 Voice-to-MIDI Y Y 3 (MIDI note on note off ) 1 2

Voice-to-MIDI 3 2 (1) (2) 0 (1) 2 (2) 2 (3) (4) [ 02] 2 3 1 1 1 Voice-to-MIDI Microsoft Visual C 2005, Visual C++2005 DLL DirectSound ( ) E2-G5 A4 = 440Hz MIDI 22.05kHz, 16bit, Y Y 20pixel 40pixel 1cm 40pixel note on/off Wave EXCEL 2 (STFT)

4 1 Condition 2 FFT( = 2048samples : 100ms) E2-G5 FFT [ 83] cent STFT 256samples 12ms 3. 3 1 1 5 Condition 1 Condition 1, 3, 4 Condition Voiceto-MIDI Condition 5 YAMAHA XGWorks ST[YAMAHA 03] 7 (6 ) 6 2 31 2 2 : 2 : 6 2 B hp 2710p PC 3 3

Voice-to-MIDI 5 3 A MIDI MIDI 1 3 Condition 3 5 Condition Condition Condition 1 3 1 1 Wave MIDI XG WorksST MIDI BPM=60 BPM=60 BPM=110 Condition 3 2 B ( Voice-to-MIDI ) A 3 MIDI 30 10 4 2 16 17 17 16 3 A A 2 3 D 3 D 3 D 3 E3 2 A 4 30 31 3 16 17 15 16 4 15 G3 16 17 2 A 4 2 14 15 15 G3 14 F 3 F 3 G3 2 13 14 2 6 A 3

6 3 A 3 3 4 B A Condition 3 3 Condition 1 Voice-to-MIDI Condition 2 A Condition 3 Condition 4 Condition 1 Condition 4 Condition 5 Voice-to-MIDI ( ) B Condition 4 4 Condition 3 Condition 4 Condition 2 A B Condition 1 B Condition 4 Condition 1 3 4 Wave Condition 5 A 4. Voiceto-MIDI Voice-to-MIDI [YAMAHA 03] XGworks ST http://www. yamaha.co.jp/product/syndtm/p/cmp/xgwstw/index.html. [INTERNET 06] SingerSongWriter Lite5 http://www.ssw.co.jp/products/ssw/win/sswlt50w /index.html. [ 06] MIDI 2step 2006-EC-5, Vol.2006, pp.43-48 2006. [ 07] MA2007-73 Vol.26, No.6, pp.99-104 2007. [Lutz 01] Lutz P Rainer T An Interface for melody input ACM Trans. on Computer-Human Interaction TOCHI Vol.8 No.2 pp133-149 2001 [Alexandra 99] Alexandra U Justin Z Melodic matching techniques for large music databases Proc. of the seventh ACM int. conf. on Multimedia MULTIMEDIA 99 pp57-66 1999 [ 98] Tomonari Sonoda Masataka Goto Yoichi Muraoka A WWW-based Melody Retrieval System ICMC 98 Proc. pp349-352 1998 [ 02] Vol.43, No.2, pp.287-298 2002. [ 83] pp718-723 1983