<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Rocksoccer&#039;s Fucking World &#187; MEng Co-channel Speech Separation Project</title>
	<atom:link href="http://rocksoccer.xtreemhost.com/wordpress/archives/category/meng-co-channel-speech-separation-project/feed" rel="self" type="application/rss+xml" />
	<link>http://rocksoccer.xtreemhost.com/wordpress</link>
	<description>Fuck the fucking world</description>
	<lastBuildDate>Fri, 12 Mar 2010 00:09:51 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>20080313</title>
		<link>http://rocksoccer.xtreemhost.com/wordpress/archives/37</link>
		<comments>http://rocksoccer.xtreemhost.com/wordpress/archives/37#comments</comments>
		<pubDate>Thu, 13 Mar 2008 14:17:52 +0000</pubDate>
		<dc:creator>Rocksoccer</dc:creator>
				<category><![CDATA[MEng Co-channel Speech Separation Project]]></category>

		<guid isPermaLink="false">http://rocksoccer.xtreemhost.com/wordpress/archives/37</guid>
		<description><![CDATA[

This is the estimated result. With the new real and imaginary part matching algorithm, it seems all problems I encountered before have gone.



This is just a part of the generated signal. Compared to the original individual signal, the estimated result seems to be quite accurate to some extent. 


So, next step is to move to [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/1.png"/>
<p>
This is the estimated result. With the new real and imaginary part matching algorithm, it seems all problems I encountered before have gone.<span id="more-37"></span><br />
<br />
<img src="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/2.png"/>
<p>
This is just a part of the generated signal. Compared to the original individual signal, the estimated result seems to be quite accurate to some extent. </p>
<p><a href="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/original-individual-s1.png" target="_blank"><br />
<img src="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/original-individual-s1.png" height="557" width="973" title="Click to open large image in a new window"/></a></p>
<p><strong>So, next step is to move to speech processing. <em>Maybe not too difficult, who knows.</em></strong></p>
]]></content:encoded>
			<wfw:commentRss>http://rocksoccer.xtreemhost.com/wordpress/archives/37/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IEEE &#8211; The fast Fourier transform for experimentalists. Part VI. Chirp of a bat</title>
		<link>http://rocksoccer.xtreemhost.com/wordpress/archives/27</link>
		<comments>http://rocksoccer.xtreemhost.com/wordpress/archives/27#comments</comments>
		<pubDate>Wed, 12 Mar 2008 00:19:18 +0000</pubDate>
		<dc:creator>Rocksoccer</dc:creator>
				<category><![CDATA[MEng Co-channel Speech Separation Project]]></category>

		<guid isPermaLink="false">http://rocksoccer.xtreemhost.com/wordpress/archives/27</guid>
		<description><![CDATA[IEEE Xplore# Wrapper Result


1.
The fast Fourier transform for experimentalists. Part VI. Chirp of a bat
                                         [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://ieeexplore.ieee.org/search/wrapper.jsp?arnumber=1599376">IEEE Xplore# Wrapper Result</a></p>
<tr>
<td class="bodyCopyBlackLargeSpaced" valign="top" width="8%"></td>
<td class="bodyCopyBlackLargeSpaced" valign="top" width="3%"><strong>1.</strong></td>
<td class="bodyCopyBlackLargeSpaced" width="89%"><strong>The fast Fourier transform for experimentalists. Part VI. Chirp of a bat</strong></p>
<p>                                                    Donnelly, D.;</p>
<p>                                                    Computing in Science &#038; Engineering [see also IEEE Computational Science and Engineering]</p>
<p>                                                    Volume 8,<br />
Issue 2,<br />
March-April 2006<br />
Page(s):72 &#8211; 78<br />
<br /><span id="more-27"></span><strong>Abstract:</strong></p>
<p>                                                    Two assumptions underlie the Fourier transform process: stationarity and linearity. When signals deviate from these conditions, the transform outcomes are suspect. A chirp, which by definition has a frequency that varies with time, doesn&#8217;t satisfy these requirements, and its fast Fourier transform (FFT) doesn&#8217;t adequately express the changing nature of the signal&#8217;s frequency content. In this analysis of a bat chirp, I first examine how the FFT handles a chirp and then how we can use a sequence of windows that individually span only a portion of the total time-domain signal to generate a frequency versus time description of the signal. The trade-off in this kind of windowing is between dynamic response and resolution: we obtain improved dynamics if we use shorter windows, whereas we get better resolution with longer windows. This article and this series concludes with a brief look at the Hilbert-Huang transform, which isn&#8217;t constrained by the same assumptions as the FFT. This transform process consists of two independent sets of operations. The first, called empirical mode decomposition, generates a set of intrinsic mode functions (IMFs), from the data. The second step extracts phase information from each IMF and its Hilbert transform. The derivative of the phase with respect to time yields the instantaneous frequency. The net effect of these operations is to transform the time-domain data to frequency versus time data instead of the amplitude versus frequency the FFT obtains.</p>
</td>
</tr>
<tr>
<td colspan="3" valign="top"><img src="/images/transparent_spacer.gif" height="5" width="165"></td>
</tr>
<tr>
<td colspan="3" valign="top"><img src="/images/transparent_spacer.gif" height="5" width="165"></td>
</tr>
<tr>
<td colspan="2" align="right" valign="top">
                                                </td>
<td class="bodyCopyBlackLargeSpaced" valign="middle">
<p>                                                                Full Text:<br />
			                                                <a href="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/01599376.pdf" title='01599376.pdf'>PDF</a>(464 KB)</p>
<p>                                                    <span class="sectionHeaders">  IEEE JNL</span></td>
</tr>
]]></content:encoded>
			<wfw:commentRss>http://rocksoccer.xtreemhost.com/wordpress/archives/27/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ScienceDirect &#8211; Signal Processing : Underdetermined blind source separation using sparse representations</title>
		<link>http://rocksoccer.xtreemhost.com/wordpress/archives/26</link>
		<comments>http://rocksoccer.xtreemhost.com/wordpress/archives/26#comments</comments>
		<pubDate>Wed, 12 Mar 2008 00:11:43 +0000</pubDate>
		<dc:creator>Rocksoccer</dc:creator>
				<category><![CDATA[MEng Co-channel Speech Separation Project]]></category>

		<guid isPermaLink="false">http://rocksoccer.xtreemhost.com/wordpress/archives/26</guid>
		<description><![CDATA[ScienceDirect &#8211; Signal Processing : Underdetermined blind source separation using sparse representations
Abstract
The scope of this work is the separation of N sources from M linear mixtures when the underlying system is underdetermined, that is, when M&#60;N. If the input distribution is sparse the mixing matrix can be estimated either by external optimization or by clustering [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sciencedirect.com/science?_ob=ArticleURL&#038;_udi=B6V18-444P1BV-9&#038;_user=122868&#038;_rdoc=1&#038;_fmt=&#038;_orig=search&#038;_sort=d&#038;view=c&#038;_acct=C000010083&#038;_version=1&#038;_urlVersion=0&#038;_userid=122868&#038;md5=d855e006771dc2bac447163c4c3fe9b2">ScienceDirect &#8211; Signal Processing : Underdetermined blind source separation using sparse representations</a><br />
<strong>Abstract</strong></p>
<p>The scope of this work is the separation of N sources from M linear mixtures when the underlying system is underdetermined, that is, when M&lt;N. If the input distribution is sparse the mixing matrix can be estimated either by external optimization or by clustering and, given the mixing matrix, a minimal l1 norm representation of the sources can be obtained by solving a low-dimensional linear programming problem for each of the data points. Yet, when the signals per se do not satisfy this assumption, sparsity can still be achieved by realizing the separation in a sparser transformed domain. The approach is illustrated here for M=2. In this case we estimate both the number of sources and the mixing matrix by the maxima of a potential function along the circle of unit length, and we obtain the minimal l1 norm representation of each data point by a linear combination of the pair of basis vectors that enclose it. Several experiments with music and speech signals show that their time-domain representation is not sparse enough. Yet, excellent results were obtained using their short-time Fourier transform, including the separation of up to six sources from two mixtures.</p>
<p>Full Text: <a href="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/underdetermined-blind-source-separation-using-sparse-representations.pdf">PDF</a></p>
]]></content:encoded>
			<wfw:commentRss>http://rocksoccer.xtreemhost.com/wordpress/archives/26/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Frequently Asked Questions (FAQ) &#8211; The DSP Dimension</title>
		<link>http://rocksoccer.xtreemhost.com/wordpress/archives/25</link>
		<comments>http://rocksoccer.xtreemhost.com/wordpress/archives/25#comments</comments>
		<pubDate>Tue, 11 Mar 2008 23:35:47 +0000</pubDate>
		<dc:creator>Rocksoccer</dc:creator>
				<category><![CDATA[MEng Co-channel Speech Separation Project]]></category>

		<guid isPermaLink="false">http://rocksoccer.xtreemhost.com/wordpress/archives/25</guid>
		<description><![CDATA[Frequently Asked Questions (FAQ) &#8211; The DSP Dimension
I think there might be a problem with you pitch shifter when you have two or more frequency components that fall into the same bin. For example: 1) If you have 2 freq. components that fall into separate FFT frequency bins, you are OK. Each freq. component will [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.dspdimension.com/admin/frequently-asked-questions-faq/">Frequently Asked Questions (FAQ) &#8211; The DSP Dimension</a></p>
<p><strong>I think there might be a problem with you pitch shifter when you have two or more frequency components that fall into the same bin. For example: 1) If you have 2 freq. components that fall into separate FFT frequency bins, you are OK. Each freq. component will be shifted by the same ratio. (ex. 10%) 2) If you have 2 freq. components that fall into the same FFT frequency bin, but they appear at different times, you are still OK, each freq. component will proportionally be shifted by the same ratio. 3) However, if you have 2 freq. components that fall into the same FFT frequency bin, and they happen at the same time, BOTH COMPONENTS WILL BE SHIFTED BY THE SAME AMOUNT, which means, different ratios. Is this a known issue?<br />
</strong>
<p>
<span id="more-25"></span><br />
Indeed, your observation is correct &#8211; this is related to the resolution properties of the DFT.<br />
For the case you mention, there are two different scenarios:<br />
1) Consider two closely spaced sinusoids in your input signal that fall into the same bin, for example at 500 and 505 Hz. In this case, they will be shifted both by the same amount (doubling the frequency will move them to 1000 and 1005 Hz instead of 1000 and 1010 Hz) because scaling the bin frequency just heterodynes (frequency shifts) everything within the bin bandwidth. The net effect of this shift might not be prominent enough to be noticed by the listener &#8211; depending on the DFT size (and the resulting bin frequency resolution). For small DFT sizes however this might cause the resulting sound to be inharmonic and metallic.<br />
2) Scaling two closely spaced frequencies that are initially NOT located within the same bin might move them closer together for a bin frequency scaling factor < 1.0 which might eventually cause them to fall into the same bin. In that case, the net phase effect should be calculated differently from the way it is done in the code now. I’ve not taken this into account in the present version to keep the code readable.<br />
One might also argue that closely spaced frequencies will cause a low frequency modulation (”beating”) in the signal, which will result in yet another bin getting involved in the process. Therefore, the pitch shifter introduced at the DSP Dimension web site is actually a simplified version of a much more complex process if you really want to do it “right” <img src="http://www.dspdimension.com/wp-includes/images/smilies/icon_wink.gif" alt=";-)" class="wp-smiley"></p>
]]></content:encoded>
			<wfw:commentRss>http://rocksoccer.xtreemhost.com/wordpress/archives/25/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ScienceDirect &#8211; Thin Solid Films : Non-linear dynamic responses of a semiconductor gas sensor — Competition effect on the sensor responses to gaseous mixtures</title>
		<link>http://rocksoccer.xtreemhost.com/wordpress/archives/24</link>
		<comments>http://rocksoccer.xtreemhost.com/wordpress/archives/24#comments</comments>
		<pubDate>Tue, 11 Mar 2008 21:07:06 +0000</pubDate>
		<dc:creator>Rocksoccer</dc:creator>
				<category><![CDATA[MEng Co-channel Speech Separation Project]]></category>

		<guid isPermaLink="false">http://rocksoccer.xtreemhost.com/wordpress/archives/24</guid>
		<description><![CDATA[ScienceDirect &#8211; Thin Solid Films : Non-linear dynamic responses of a semiconductor gas sensor — Competition effect on the sensor responses to gaseous mixtures
]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sciencedirect.com/science?_ob=ArticleURL&#038;_udi=B6TW0-438TYRN-1N&#038;_user=122868&#038;_rdoc=1&#038;_fmt=&#038;_orig=search&#038;_sort=d&#038;view=c&#038;_acct=C000010083&#038;_version=1&#038;_urlVersion=0&#038;_userid=122868&#038;md5=89b57ab3a462b6582545259feebfd6ba">ScienceDirect &#8211; Thin Solid Films : Non-linear dynamic responses of a semiconductor gas sensor — Competition effect on the sensor responses to gaseous mixtures</a></p>
]]></content:encoded>
			<wfw:commentRss>http://rocksoccer.xtreemhost.com/wordpress/archives/24/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ScienceDirect &#8211; Sensors and Actuators B: Chemical : Detection of a sample gas in the presence of an interferant gas based on a nonlinear dynamic response</title>
		<link>http://rocksoccer.xtreemhost.com/wordpress/archives/23</link>
		<comments>http://rocksoccer.xtreemhost.com/wordpress/archives/23#comments</comments>
		<pubDate>Tue, 11 Mar 2008 21:03:15 +0000</pubDate>
		<dc:creator>Rocksoccer</dc:creator>
				<category><![CDATA[MEng Co-channel Speech Separation Project]]></category>

		<guid isPermaLink="false">http://rocksoccer.xtreemhost.com/wordpress/archives/23</guid>
		<description><![CDATA[ScienceDirect &#8211; Sensors and Actuators B: Chemical : Detection of a sample gas in the presence of an interferant gas based on a nonlinear dynamic response
Sensors and Actuators B: Chemical
   
  
Volume 56, Issues 1-2,
1 July 1999,
Pages 79-84
]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sciencedirect.com/science?_ob=ArticleURL&#038;_udi=B6THH-3X4VJTN-D&#038;_user=122868&#038;_rdoc=1&#038;_fmt=&#038;_orig=search&#038;_sort=d&#038;view=c&#038;_acct=C000010083&#038;_version=1&#038;_urlVersion=0&#038;_userid=122868&#038;md5=ab2edb91c41b41f70ddca2448b657bf1">ScienceDirect &#8211; Sensors and Actuators B: Chemical : Detection of a sample gas in the presence of an interferant gas based on a nonlinear dynamic response</a><br />
<a href="/science/journal/09254005"><strong>Sensors and Actuators B: Chemical</strong></a><br />
   </p>
<p>  <a href="/science?_ob=PublicationURL&#038;_tockey=#TOC#5283#1999#999439998#114730#FLA#&#038;_cdi=5283&#038;_pubType=J&#038;_auth=y&#038;_acct=C000010083&#038;_version=1&#038;_urlVersion=0&#038;_userid=122868&#038;md5=d2362a9a18847a31124e4075e68c72d8"></p>
<p>Volume 56, Issues 1-2</a>,</p>
<p>1 July 1999,</p>
<p>Pages 79-84</p>
]]></content:encoded>
			<wfw:commentRss>http://rocksoccer.xtreemhost.com/wordpress/archives/23/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>20080308</title>
		<link>http://rocksoccer.xtreemhost.com/wordpress/archives/16</link>
		<comments>http://rocksoccer.xtreemhost.com/wordpress/archives/16#comments</comments>
		<pubDate>Sat, 08 Mar 2008 15:22:19 +0000</pubDate>
		<dc:creator>Rocksoccer</dc:creator>
				<category><![CDATA[MEng Co-channel Speech Separation Project]]></category>

		<guid isPermaLink="false">http://rocksoccer.xtreemhost.com/wordpress/archives/16</guid>
		<description><![CDATA[Almost the end of the project, but still not too much results are available yet.
//check hte index
with 1 hz step size
1, full, known f0, different combination
2,

Omega_row and omega_col only give the fundamental frequency of every search. They are different from the
frequency indices. The fundamental frequency of the first signal is represented with
100+omega_row*search_step_size. The fundamental frequency [...]]]></description>
			<content:encoded><![CDATA[<p>Almost the end of the project, but still not too much results are available yet.</p>
<p>//check hte index</p>
<p>with 1 hz step size</p>
<p>1, full, known f0, different combination<br />
2,<br />
<span id="more-16"></span></p>
<p>Omega_row and omega_col only give the fundamental frequency of every search. They are different from the<br />
frequency indices. The fundamental frequency of the first signal is represented with<br />
100+omega_row*search_step_size. The fundamental frequency of the second signal is represented with 100+<br />
(omega_row+omega_col+1)*search_step_size. The second index is in fact the difference between the<br />
fundamental frequency of the first signal and that of the second.</p>
<p>The problem is still how to estimate the fundamental frequency. More accurately, is it possible to evaluate the difference of amplitude of two signals in frequency domain?</p>
<p><img src="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/image001.jpg" alt="" /><br />
<img src="http://rocksoccer.hostsnake.com/wordpress/wp-content/uploads/image002.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://rocksoccer.xtreemhost.com/wordpress/archives/16/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
