figure 6(a). conceptual diagram illustrating vector quantization … · 2016-11-07 · gmm based...

25
101 Proceedings of National Conference On Web and Knowledge Based Systems (WKBS) objects of interest are generically called patterns and in our case are sequences of acoustic vectors that are extracted from an input speech using the techniques described in the previous section. The classes here refer to individual speakers. Since the classification procedure in our case is applied on extracted features, it can be also referred to as feature matching. Figure 6(a) shows a conceptual diagram to illustrate recognition process. In the figure, only four speakers and two dimensions of the acoustic space are shown, illustrating vector quantization codebook formation. One speaker can be discriminated from another based of the location of centroids.. Figure 6(a). Conceptual diagram illustrating vector quantization codebook formation. One speaker can be discriminated from another based of the location of centroids. In the training phase, using the clustering algorithm described in the next Section, a speaker-specific VQ codebook (Table 3) is generated for each known speaker by clustering his/her training acoustic vectors. In the recognition phase, an input utterance of an unknown voice is “vector-quantized” using each trained codebook and the total VQ distortion is computed. The speaker corresponding to the VQ codebook with smallest total distortion is identified as the speaker of the input utterance. Sript to generate 12 MFCC coefficients and compute 8 VQ codeword. This generates 12 X 206 MFCC vector and 12 X 8 VQ codewords. Figure 6(b) is the plot of MFCC vector and 8 codeword of the speech signal. c1 = mfcc(s1, fs1); d1 = vqlbg(c1,8) plot(c1(5, :), c1(6, :), ‘xr’) hold on plot(d1(5, :), d1(6, :), ‘vk’) legend(‘Speaker 1’, ‘Codebook 1’)

Upload: others

Post on 18-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

101Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

objects of interest are generically called patterns and in our case are sequences of acousticvectors that are extracted from an input speech using the techniques described in theprevious section. The classes here refer to individual speakers. Since the classificationprocedure in our case is applied on extracted features, it can be also referred to asfeature matching.

Figure 6(a) shows a conceptual diagram to illustrate recognition process. Inthe figure, only four speakers and two dimensions of the acoustic space are shown,illustrating vector quantization codebook formation. One speaker can be discriminatedfrom another based of the location of centroids..

Figure 6(a). Conceptual diagram illustrating vector quantization codebook formation.One speaker can be discriminated from another based of the location of centroids.

In the training phase, using the clustering algorithm described in the next Section,a speaker-specific VQ codebook (Table 3) is generated for each known speaker by clusteringhis/her training acoustic vectors. In the recognition phase, an input utterance of anunknown voice is “vector-quantized” using each trained codebook and the total VQdistortion is computed. The speaker corresponding to the VQ codebook with smallesttotal distortion is identified as the speaker of the input utterance.

Sript to generate 12 MFCC coefficients and compute 8 VQ codeword. This generates12 X 206 MFCC vector and 12 X 8 VQ codewords. Figure 6(b) is the plot of MFCC vectorand 8 codeword of the speech signal.

c1 = mfcc(s1, fs1);d1 = vqlbg(c1,8)plot(c1(5, :), c1(6, :), ‘xr’)hold onplot(d1(5, :), d1(6, :), ‘vk’)legend(‘Speaker 1’, ‘Codebook 1’)

Page 2: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

102Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

-32.8 -9.34 -24.2 7.24 -29.8 -1.72 -18.3 2.04

-1.28 -0.5 -4.76 -1.42 -2.08 1.33 -1.93 0.43

0.78 4.72 0.57 -3.63 0.01 2.87 -0.48 -0

0.53 0.61 2.31 2.52 0.84 0.47 1.57 1.29

0.07 -3.12 -2.02 -3.01 -0.5 -4.88 -1.83 -4.5

0.1 1.93 0.46 0.18 0.19 0.74 0.15 0.38

-0.1 -1.45 -0.12 0.35 0.14 -1.54 -0.85 -0.5

-0.04 0.1 -0.24 -0.07 -0.18 0.08 0.14 -0.6

-0.09 -0.52 0.22 -1.28 -0.09 -0.76 -0.15 -0.6

-0.03 0.15 0.03 0.03 -0.02 0.66 -0.19 0.43

-0.03 -0.02 0.02 -0.18 0.09 0.25 0.21 0.39

0.18 -0.06 0.16 -0.01 0.2 0.19 -0.09 0.21

Table 3: Codebook of S1 (12 X 8) codewords

Figure 6(b). Graph illustrating vector quantizationcodebook formation of one speaker.

5.6 Clustering the Training Vectors

After the enrolment session, the acoustic vectors extracted from input speech ofeach speaker provide a set of training vectors for that speaker. The next important step isto build a speaker-specific VQ codebook for each speaker using those training vectors.There is a well-known algorithm, namely LBG algorithm [Linde, Buzo and Gray, 1980], forclustering a set of L training vectors into a set of M codebook vectors (figure 7).

Page 3: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

103Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

Figure 7. Flow diagram of the LBG algorithm

6. Result of the proposed system

In this paper we experiment with the building and testing of an SpeakerIdentification irrespective of emotion. In order to build such a system, we have followedthe steps that were described in previous sections using Matlab environment.

Following confusion matrix is generated after testing each speech file in the testdatabase with the trained vectors generated for computer speaker identification system.

Table 4: Confusion matrix created on the basis of Computer Emotion based SpeakerIdentification System..

Page 4: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

104Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

The following can be computed from this confusion matrix:

• The model made 46 correct predictions (8+10+3+2+3+13+7) .• The model made 12 incorrect predictions (1+2+1+1+1+1+4+1).• The model scored 58 cases (46+12).• The error rate is 12/58 =20.69%.• The accuracy rate is 46/58= 79.31%.

7. Conclusion

In this paper a well known technique of MFCC and VQ is used for speakeridentification irrespective of emotions and applied it to text independent speakeridentification system.

The result shows that with proposed method we are able to achieve comparableaccuracies of 79.31% of speaker’s identification by the system as compared to 97.25% ofspeaker’s identification by human. The results of our experiments are limited to recognizethe speaker based on the devices used for recording the corresponding speech file.

8. REFERENCES

[1] Jian Zhou, Guoyin Wang, Yong Yang, Peijun Chen., Speech Emotion RecognitionBased on Rough Set and SVM, Proc. 5th IEEE Int. Conf. on Cognitive Informatics(ICCI’06), @)2006.

[2] Liqin Fu Xia Mao Lijiang Chen., Relative Speech Emotion Recognition Based ArtificialNeural Network, 2008 IEEE.

[3] Kevin M. Indrebo, Richard J. Povinelli, and Michael T. Johnson., THIRD-ORDERMOMENTS OF FILTERED SPEECH SIGNALS FOR ROBUST SPEECH RECOGNITION.

[4] E. Darren Ellis., Design of a Speaker Recognition Code using MATLAB, 2001.

[5] Wang Yu., Research and Implementation of Emotional feature Classification andRecognition in Speech Signal, 2008 IEEE.

[6] Jaros³aw Cichosz, Krzysztof Œlot., Emotion recognition in speech signalusing emotion extracting binary decision trees.

Page 5: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

105Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

[7] Zhongzhe Xiao, Emmanuel Dellandrea, Weibei Dou and Liming Chen., FeaturesExtraction and Selection for Emotional Speech Classification.

[8] Sheeraz Memon, Margaret Lech, Namunu Maddage and Ling He., Application ofthe Vector Quantization Methods and the Fused MFCC-IMFCC Features in theGMM based Speaker Recognition.

[9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition System Basedon MFCC and VQ Algorithms, World Academy of Science, Engineering andTechnology 57 2009.

[10] Tetsuo Kosaka, Tatsuya Akatsu, Masaharu Kato and Masaki Kohda., Speaker Vector-Based Speaker Recognition with Phonetic Modeling., Speech Recognition,Technologies and Applications, Book edited by: France Miheliè andJanez •ibert, ISBN 978-953-7619-29-9, pp. 550, November2008, I-Tech, Vienna, Austria.

[11] Md. Rashidul Hasan, Mustafa Jamil, Md. Golam Rabbani Md. Saifur Rahman.,SPEAKER IDENTIFICATION USING MEL FREQUENCY CEPSTRAL COEFFICIENTS, 3rdInternational Conference on Electrical & Computer Engineering ICECE 2004, 28-30 December 2004, Dhaka, Bangladesh.

[12] Bojan Kotnik, Damjan Vlaj, Zdravko Kaèiè, Bogomir Horvat.,ROBUST MFCC FEATURE EXTRACTION ALGORITHM USING EFFICIENT ADDITIVEAND CONVOLUTIONAL NOISE REDUCTION PROCEDURES, ICSLP’02 Proceedings,Denver, Colorado, USA, pp. 445-448, 2002.

[13] Benjamin J. Shannon and Kuldip K. Paliwal., MFCC Computation from MagnitudeSpectrum of Higher Lag Autocorrelation Coefficients for Robust Speech Recognition.

[14] Mohamad Adnan Al-Alaoui, Lina Al-Kanj, Jimmy Azar, and Elias Yaacoub., SpeechRecognition using Artificial Neural Networks and Hidden Markov Models, IEEEMULTIDISCIPLINARY ENGINEERING EDUCATION MAGAZINE, VOL. 3, NO. 3,SEPTEMBER 2008.

[15] Christian Cornaz Urs Hunkeler., An automatic speaker recognition system, ChristianCornaz, Urs Hunkeler 03.02.2003.

Page 6: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

106Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

[16] Patricia Melin, Jerica Urias, Daniel Solano, Miguel Soto, Miguel Lopez, and OscarCastillo., Voice Recognition with Neural Networks, Type-2 Fuzzy Logic and GeneticAlgorithms., Patricia Melin, Jerica Urias, Daniel Solano, Miguel Soto, Miguel Lopez,and Oscar Castillo

[17] A.Revathi1, R.Ganapathy2 and Y.Venkataramani ., Text Independent SpeakerRecognition and Speaker Independent Speech Recognition Using IterativeClustering Approach.,International Journal of Computer science & InformationTechnology (IJCSIT), Vol 1, No 2, November 2009.

[18] Leena Mary, B. Yegnanarayana., Extraction and representation of prosodic featuresfor language and speaker recognition

[19] Kishore Prahallad, Speech Technology: A Practical Introduction, Spectrogram,Cepstrum and Mel-Frequency Analysis., Carnegie Mellon University & InternationalInstitute of Information Technology Hyderaba, Speech Communication 50 (2008)782–796.

Page 7: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

Nano Sensors Network: The Way to Analyzethe Area Around us

Prashant AhlawatAssistant Professor , ITM University, Gurgaon

Email ID: [email protected] Kumar Sharma

Assistant Professor, IT Department, ITM University, GurgaonEmail ID: [email protected]

ABSTRACT

The unique properties of nanoscale materials will enable the development of anew generation of environmental sensing systems. Examples include: (1) distributedarrays of smart sensor networks that could be used to quantify the spatial, chemical,and biological dynamics of an eco-system in real time; (2) small, inexpensive, low-power, multifunctional sensor arrays, distributed in public places, homes, or onindividuals, that could be used to warn of pollutants and other environmental hazards;and (3) improved characterization of the life cycle of natural and anthropogenicnanoparticles in the environment. Measurement science and technology will enablethe development of a comprehensive understanding of the interaction and fate of naturaland anthropogenic nanoscale and nanostructured materials in the environment.Examples of uses of these measurements include identifying conditions for safemanufacturing, use, and disposal of nanotubes, nanoparticles, and other nanoscalematerials; and understanding how the morphology, size, composition, surface reactivity,and transport of combustion-generated particles affect human health and global climatechange.

Keywords: Nano Sensor, Informatics, Sensor, Nanotechnology

Page 8: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

108Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

1. Vision

The unique properties of nanoscale materials will enable the development of anew generation of environmental sensing systems. Examples include: (1) distributed arraysof smart sensor networks that could be used to quantify the spatial, chemical, and biologicaldynamics of an eco-system in real time; (2) small, inexpensive, low-power, multifunctionalsensor arrays, distributed in public places, homes, or on individuals, that could be usedto warn of pollutants and other environmental hazards; and (3) improved characterizationof the life cycle of natural and anthropogenic nanoparticles in the environment.Measurement science and technology will enable the development of a comprehensiveunderstanding of the interaction and fate of natural and anthropogenic nanoscale andnanostructured materials in the environment. Examples of uses of these measurementsinclude identifying conditions for safe manufacturing, use, and disposal of nanotubes,nanoparticles, and other nanoscale materials; and understanding how the morphology,size, composition, surface reactivity, and transport of combustion-generated particlesaffect human health and global climate change.

2. Current Scientific and Technological Advancements

Recent findings indicate that nanoscale dimensions play a critical role indetermining particle transport in human tissue, and therefore, are key to understandinghealth effects. Significant gains have been made by coupling studies of environmentalinteractions to nanostructured material research and device design. The last few yearshave witnessed dramatic improvements in the fabrication of nanoscale materials of interestfor environmental analysis. The use of electric fields and flowing fluids to align carbonnanotubes during and/or after growth has made it possible to produce nanoscale sensors,which have been noncovalently linked to biomolecules and used as nanoscale sensorelements.

1-10 Electrochemically fabricated metallic nanowires has been another major

advance. By changing the composition of the electrolytic solution during deposition, it ispossible to fabricate nano-wires that have complex structures akin to bar codes. The sizeand shape dependent properties of nanostructured materials give rise to new electricaland optical properties that have been incorporated into new chemical and biologicalsensors. For example, individual nanoparticles fabricated from semiconducting materialshave been used as new types of fluorescent tags that are far superior to molecularfluorophors.

12Collective effects of nanostructured materials are being used as the basis

for new types of optical and electrical sensing (e.g., the “artificial nose”). Direct electricalsensing of chemical and biological molecules using individual nanowires and nanotubeshas been demonstrated.

New methods for controlling and guiding the growth and assembly

of nanoscale materials and monolayer films also have been developed.

Page 9: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

109Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

3. Goals for the Next 10–15 Years: Barriers and Solutions

The biotechnology revolution has led to rapid advances in the detection ofbiomolecules. Nanotechnology provides a way to strongly leverage these advances towardthe development of environmental sensing systems that can operate on a continuous mode.The important role of microbes in controlling a large number of environmental processes hasbeen known for many years; however, only recently has it been possible to use genomicanalysis to understand biological diversity in environmental systems.

To fully understand the

relationships of biological systems in the environment, it will be necessary to identifybiological species through genetic analysis and assess what proteins are being expressed.This will require the ability to conduct biological analysis at extremely low concentrations,below the detection limits of existing biological sensors. Moreover, to fully understandenvironmental systems, it will be necessary to measure large numbers of chemical andbiological species simultaneously and correlate such measurements over many length scales,from sub-micron to hundreds of kilometers.

The events surrounding the Bacillus anthracis (Anthrax) releases of 2001 highlightedthe fact that good sensor technologies do not yet exist for rapid (ideally, real-time) analysisof microbes and their constituent biomolecules. Important underlying technical issues include:(1) existing biological sensor technologies are not sufficiently stable to allow usage on an insitu, continuous basis; (2) most chemical sensors use methods that are optimized for detectionof single species (array methods are fairly well developed for DNA, but the use of RNA arraysand antibody arrays is only now becoming widespread and there is not a general array fordetection of a wide variety of potential analytes); and (3)sensitivity remains an issue, becausemany molecules of interest are present at extremely low concentrations.

From the standpoint of environmental measurement, problems exist in measuringanthropogenic and natural nanoparticles that are present in the soil, air, and water. Particlesin liquid phases present unique measurement challenges. Little is known about the diversityof chemical composition at the nanoparticle level and the transformations that occur. Humanhealth, epidemiological and toxicological studies of the impact of combustion-generatednanoscale particles in ambient air on human cardiovascular health are inconclusive whenhealth effects (health endpoints) or physiological response are correlated with the bulkchemical composition of particles. Therefore, measurement techniques are needed thatdistinguish the chemical composition of particle surface layers from the bulk particle interior.A major challenge in using nanotechnology for environmental analysis is the need for genericnanoscale assembly methods. The usefulness of rapid advances in the ability to fabricate andfunctionalize individual nanoparticles and nanotubes will be realized when these materialsare integrated into more complex assemblies (i.e., dense arrays) and combined with

Page 10: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

110Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

macroscopic materials (i.e., silicon-based microelectronics) in a cost-effective manner.Nanoscale materials such as carbon nanotubes can be fabricated on silicon, but in situ chemical/biochemical modification of these structures in an array format has not yet been demonstrated.Refinement of methods to measure density, shape, and surface area also are needed.Measurement tools for nanoparticles in liquids are even less developed than for those in thegas phase.

4. Opportunities

In biology, microsensors have provided a wealth of new information about biochemicaland neurological processes, but typically only under a very strict set of experimentalconditions. To understand the environment, it is necessary to dramatically expand the rangeof analytes detected and to characterize the inorganic, organic, and biochemical compositionof the environment in a nonintrusive manner. Nanotechnology provides the opportunity toachieve this goal in at least two ways. First, nanoscale sensing elements can provide improvedsensitivity compared to conventional sensors.

Second, nanotechnology makes it possible to

create a massively parallel array of thousands or millions of high-sensitivity nanoscale sensingelements.

A massively parallel array of nanoscale sensor elements (such as chemically orbiologically modified carbon nanotubes or quantum dot structures) could be used in twodistinctly different modes. If each element were tailored to respond to a different stimulus(such as a specific chemical or biochemical analyte), then the array would provide simultaneousanalysis of a large number of analytes. If each element were tailored in an identical manner,then a massively parallel array of sensor sites would provide the ability to probe physical andchemical processes with very high spatial resolution, essentially becoming a high-resolutionimaging detector. The combination of high density nanoscale sensor arrays could be combinedwith advanced nanofluidics and artificial intelligence to create a reconfigurable, adaptablearray that could, for example, modify its specificity or function under changing conditions.From the standpoint of chemical and biochemical detection, massively parallel sensor arrayswould provide several important advances. For biological detection, the ability to measurerelatively weak biomolecular interactions, such as protein-binding events and antibody-antigen interactions, is crucial. However, biochemical responses often are dictated not byindividual molecules, but by combinations of molecules and combinations of processes. Ahigh-density array could be functionalized with a large number of molecular recognitionsites. By using smart data analysis software to look for patterns of response, such a sensorsystem would provide vastly improved selectivity, in a manner similar to that by which thehuman olfactory system can identify large numbers of odors using only a limited set ofreceptors.

Page 11: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

111Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

A massively parallel sensor would also yield improvements in sensitivity. Using an array ofnanoscale sensor elements separated by a distance greater than the diffusion length, it shouldbe possible to achieve significantly improved sensitivity for electrochemical detectioncompared with today’s planar sensor.

Nanoscale systems also can be expected to achieve higher sensitivity due to intrinsicquantum effects. For example, there has been a great deal of attention paid to single-moleculeor single-particle detection and single-electron transistors. In an analogous manner, achemically modified quantum dot or other nanoscale structure should be able to achievesingle-molecule sensitivity. A single molecule interacting with the quantum dot, in turn, canrelay the sensing signal in terms of an optical property or electrical conductivity.

As an imaging detector or sensor, a high-density array would provide greatly improvedspatial resolution for specific physical or chemical/biochemical phenomena. An array ofquantum dot structures could act as an array of optical detectors or even nanoscale opticalsources, providing a way to characterize the optical properties on nanometer-length scales.An array of biosensing elements may be able to examine the spatial distribution of biologicalmolecules within more complex, small structures (such as a cell or a microbial community).Imaging detectors may be particularly important for characterization of nanoparticles. Existingmethods for characterizing particulates are slow and do not provide good statisticalinformation about chemical composition and structure. It also should be possible to fabricatean array of field-emission electron sources and electron detectors that would provide a wayto image and perform electron-based spectroscopies, such as energy-loss spectroscopy withnanometer-scale spatial resolution. This would greatly aid characterization of individualnanoparticles present in complex environmental systems.

Nanoinformatics

The use of massively parallel arrays brings with it a need for the development of newcomputational methods for understanding signal transduction in nanoscale systems andanalyzing large amounts of data from nanoscale systems in real time. This may lead to aunique field of “nanoinformatics,” analogous to the explosion of interest in bioinformatics.There are several aspects of nanoinformatics applicable to nanotechnology and theenvironment. New challenges in understanding signal transduction and data analysis innanoscale arrays and networks, systems issues for deployment, communication networking,and software are areas of fundamental interest.The fabrication of thousands or millions of “perfect” sensing elements will be extremelydifficult or impossible. The need for perfection in the sensor “hardware” can be avoided by

Page 12: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

112Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

using advanced computer training and measurement methods. This approach essentiallyshifts the need for perfection from nanoscale hardware and places it on software. Such trainingis similar to a calibration process, but would be much more extensive and involve measuringsensor response to a wide range of stimuli, followed by mathematical analysis of the resultsto provide a well-defined stimulus response function. By using redundant sensor elementsand individualized training, the need for absolute perfection in each sensor array can beeliminated. Reducing the need for strict perfection of thousands or millions of sensingelements may be an important way to develop nanoscale arrays in a cost-effective manner.The data content provided by these arrays could be overwhelming, and smart electronicprocessing and decision making systems would need to be developed to convert these rawdata into meaningful information. In the case of sensor arrays, methods such as principalcomponent analysis, neural network analysis, or other linear and nonlinear methods wouldbe used to characterize the response of multiple chemical or biological receptor elementsand extract the identity and concentration of species present in the sample. Ideally, thesecomputer methods could provide optimized information content (such as chemicalcomposition, biological identity, or physical properties) on an as needed basis to an individualuser or as part of a real-time control system.

5. Scientific and Technological nfrastructure

Infrastructure needs include: (1) long-term (4-5 years) support for interdisciplinaryprojects; (2) increased support for tightly knit, small collaborative groups (3–4 PrincipalInvestigators) in focused areas of research; and (3) a mechanism for providing mid-sizedinstrumentation grants ($100K–$1M) to small groups without requiring matching funds.

6. R&D Investment and Implementation Strategy

Research NeedsThe research needs for developing high-density sensor arrays for in situ, real-time,

multiple-analyte quantification in the environment are categorized as follows.

Development of High-Density Sensor Probes

High-density sensor probes require the development of complex nanoscale moleculararchitectures and an understanding of their responses to chemical/biomolecular stimuli. Italso is necessary to develop an understanding of how to control, manipulate, and immobilizenanoscale materials to fabricate integrated sensor architectures. A greater understanding ofhow weak forces such as electrostatic forces, magnetic fields, hydrophobic-hydrophilic

Page 13: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

113Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

interactions, and hydrogen bonds affect and control the behavior of molecules and nanoscalesystems is needed. It also is essential to understand the collective properties of nanoscalematerials.

Research is needed to achieve optimal chemical and biological modifications on thesensor surface to permit specific and robust binding, and therefore, detection, for each analyte.Additionally, different sensing environments (e.g., air, water, and soil) can pose limitationson certain transduction mechanisms. To best suit different sensing environments, diversetransduction mechanisms such as electric, optic, magnetic, piezoelectric, magnetoresistive,and so on require exploration. Understanding how chemical and/or biomolecular interactionstranslate into optical, electrical, or other transduction signals is critical.

Integration With Silicon-Based Microelectronics

To achieve simultaneous detection of a large number of analytes, research on theintegration of sensor probes with silicon-based microfluidics and microelectronic circuitry isneeded. This will allow: (1) simultaneous data acquisition, (2) an easy computer interface forreal-time analysis, and (3) reduction in the sensor system’s size and increased potential forportable, hand-held devices.

Model Development, Analysis, and Validation

A thorough analysis must be performed to gain an understanding of the in situ detectionkinetics of each individual analyte. Such information will help quantify the concentration ofeach analyte. In addition, it is necessary to develop models that will help characterize andpredict the response of high-density, possibly nonlinear and potentially interacting sensorarrays in the presence of multiple analytes. The models must be incorporated in acommunication network to allow crosstalks and feedbacks, and integrated with smart andinteractive software for decision-making. Such models and software development will helpoptimize the specificity of the response produced by the analytes of interest. It also is criticalto develop, test, and validate protocols for calibrating the response of measurement systems.Research needs for a new generation of nanoparticle detectors include the development of:(1) low cost miniaturized instruments that integrate physical and compositionalcharacterization (such devices could be used to provide dosimetrics for personal and areamonitoring for both epidemiological and environmental monitoring needs); (2) instrumentsfor studying gas and liquid nanoparticle interactions (these studies are needed to unravel thecomplex interrelationships between nanoparticles, such as those generated by combustionsources and the natural environment); and (3) particle detectors and counters that improve

Page 14: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

114Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

the state-of-the-art in liquid samples to a level comparable with that for gas-dispersednanoparticle detection.

7. Examples of Recent Achievements and Paradigm Shifts

Inspiration for this vision derives from recently acquired understanding of theproperties of nanostructured materials which, in turn, is a consequence of advances insynthesis and characterization of nanostructures. Realization of the vision is dependent onthe development of libraries of ultrasensitive and selective sensing components and advancesin the fabrication of integrated parallel sensing and signal transduction devices. A samplingof breakthroughs that support the vision include demonstrations of: (1) molecular detectionusing biomolecule functionalized metal and semiconducting nanoparticles or bar-codedmetallic nanowires,

1 (2) pathogen detection using piezoelectric cantilevers,

(3) field and flow

controlled assembly of aligned carbon nanotubes,and (4) transport of electrical and opticalsignals through designed nanostructures.

Breakthroughs that speak to the environmental

impact of both natural and anthropogenic nano-structures include the recent demonstrationthat precise nanoscale dimensions play a critical role in determining particle transport inhuman tissue and, therefore, are key to understanding health effects. This breakthroughhighlights the importance of coupling studies of environmental interactions to nanostructuredmaterial research and device design.

8. References

[1] Kumar, M.S.; Lee, S.H.; Kim, T.Y.; Kim,T.H.; Song, S.M.; Yang, J.W.; Nahm, K.S.; Suh,E.K. DC electric field assisted alignment of carbon nanotubes on metal electrodes.Solid-State Electronics 2003, 47, 2075-2080.

[2] Huang, S.; Maynor, B.; Cai, X.; Liu, J. Ultra-long, well-aligned single-walled carbonnano-tube architectures on surfaces. Adv. Mater. 2003, 15, 1651-1655.

[3] Ural, A.; Li, Y.M.; Dai, H.J. Electric field-aligned growth of single-walled carbonnanotubes on surfaces. Appl. Phys. Lett. 2002, 81, 3464-3466.

[4] Huang, S.M.; Cai, X.Y.; Liu, J. Growth of millimeter-long and horizontally alignedsingle-walled carbon nanotubes on flat substrates. J. Am. Chem. Soc. 2003, 125,5636- 5637.

[5] Joselevich, E.; Lieber, C.M. Vectorial growth of metallic and semiconducting single-wall carbon nanotubes. Nano. Lett. 2002, 2,1137-1141.

Page 15: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

115Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

[6] Zhong, Z.H.; Wang, D.L.; Cui, Y.; Bockrath, M.W.; Lieber, C.M. Nanowire crossbararrays as address decoders for integrated nanosystems. Science 2003, 302, 1377-1379.

[7] Huang, Y.; Duan, X.F.; Cui, Y.; Lauhon, L.J.; Kim, K.H.; Lieber, C.M. Logic gates andcomputation from assembled nanowire building blocks. Science 2001, 294, 1313-1317.

[8] Cui, Y.; Lieber, C.M. Functional nanoscale electronic devices assembled using siliconnanowire building blocks. Science 2001, 291, 851-853.

[9] Huang, Y.; Duan, X.F.; Wei, Q.Q.; Lieber, C.M. Directed assembly of one-dimensionalnanostructures into functional networks. Science 2001, 291, 630-633.

[10] Messer, B.; Song, J.H.; Yang, P.D. Microchannel networks for nanowire patterning. J.Am. Chem.Soc. 2000, 122, 10232-10233.

[11] Nam, J.M.; Park, S.J.; Mirkin, C.A. Bio-barcodes based on oligonucleotide-modifiednanoparticles. J. Am. Chem. Soc. 2002, 124, 3820-3821.

[12] Chan, W.C.W.; Nie, S. Quantum dot bioconjugates for ultrasensitive nonisotopicdetection. Science 1998, 281, 2016-2018.

Page 16: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

Open Source Web Content Management SystemOnline Administration, Development &

Deployment

TAPAN KAPRIAssistant Professor, IT Department, Tecnia Institute of Advanced Studies, Delhi

Email ID: [email protected]

ABSTRACT

This paper investigates the utilization of Open Source Web ContentManagement System in the area of Online Administration, Development &Deployment based on web application. A website that started small and has becomequite large through the years may present some serious problems to the web developer.Such problems may not exist during the early years of the website’s existence. But asthey grow, such problems slowly crop up until such a time that they can becomeunmanageable and may affect the whole website’s usability. The problem of effectivelymanaging a website that was growing in size and complexity was the research focus.Determining efficient and sustainable processes to assure high quality onlinecommunications was the primary purpose of the research. Research questionsidentified the features and capabilities desired by website visitors and what processesand methods could be employed to assure accurate and timely content. Qualitymanagement issues surrounding distributed publishing of online content was alsoexplored. A questionnaire was used to determine desired website functionality. Aliterature review identified the workflow processes and methods available to assureaccurate and appropriate content. The research makes a case to deploy web contentmanagement systems online.

Keywords: Web Content Management, Website reusability, High Quality onlineCommunication, distributed publishing

Page 17: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

117Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

1. INTRODUCTION

A web content management system (WCMS) is a software system that provideswebsite authoring, collaboration, and administration tools designed to allow users withlittle knowledge of web programming languages or markup languages to create andmanage website content with relative ease. A robust WCMS provides the foundationfor collaboration, offering users the ability to manage documents and output for multipleauthor editing and participation.

A web content management system [2] is software that is used for creatingand managing website content. It is used to control a large, dynamic collection of webmaterial such as documents and their associated images. A web content managementsystem facilitates content creation, editing, workflow, and many essential webmaintenance functions. Prior to this system all website updates were submitted to asingle person that manually formatted the content. With the content managementsystem in place any authorized employee could submit and maintain content. Oncesubmitted, automated workflow routines routed content to the appropriate managersfor review and approval.

The purpose of this research is to identify efficient and sustainableorganizational processes to assure high quality online communications. Publishingaccurate, timely, and error-free content is critical because the site is an importantcommunity resource with a rapidly growing user base and because it is a reflection ofthe fire district’s commitment to customer service and quality.

To begin this research it is imperative to recognize who and why people visit thewebsite and what features and capabilities are desired by those users. Once quantifiedand understood, processes and methods can be identified to assure that content isaccurate and appropriate if page authoring and publication were decentralized.

2. Literature Review

To explore the developments in CMS area, thorough investigations were carriedout. Around thirty research papers and other sources related to CMS were studied. Abrief summary of the finding of that study is presented here. To present practicalexamples, various Web CMS were explored and few modules were developed using theWeb CMS like Joomla 1.5 and SharePoint 2010.

Earlier web CMS was based on mainly LAMP technology, But Now almost allsoftware companies trying to develop the Web CMS.

Page 18: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

118Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

Price [1] investigated the problem related to manage a fire department websitethat was growing in size and complexity. He presented the solution through determiningefficient and sustainable processes to assure high quality online communications. Healso presented a case to deploy enterprise web content management systems.Sol[4] Observed that on the web, content is king. Regardless of how your web site looks,once the content is in place, it is essential that time and energy is spent in thinkingabout how to present that content on the web. Like any medium, the web has its ownquirks and intricacies that make content distribution different from other mediums suchas print, radio, or television.

From Google news [5] various news tracked and presented about the latestresearch work worldwide in the field of web application relevant with web contentmanagement system.

After analyzing the previous developments in the area of Web CMS it was foundthat even after finding solutions for many problems, there exist certain problems likeunavailability of dynamic content management, Security issues, Complexity involved inrepositories management and administration related issues that need the attention ofresearchers as well as practitioners.These problems motivated to explore better solutions for them and in current papermajor concerns are about advancement of repository management, security as well asadministrative issues.

3. Background and Significance

A content management system (CMS) is the collection of procedures used tomanage work flow in a collaborative environment. These procedures can be manual orcomputer-based. The procedures are designed to do the following:• Allow for a large number of people to contribute to and share stored data• Control access to data, based on user roles (defining which information users or

user groups can view, edit, publish)• Aid in easy storage and retrieval of data• Reduce repetitive duplicate input• Improve the ease of report writing• Improve communication between users

In a CMS, data can be defined as nearly anything: documents, movies, pictures,phone numbers, scientific data, and so forth. CMSs are frequently used for storing,controlling, semantically enriching, and publishing documentation. Serving as a central

Page 19: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

119Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

repository, the CMS increases the version level of new updates to an already existing file.Version control is one of the primary advantages of a CMS.

4. Key Applications of Web CMS:

CMS product is one of the best Content management systems available on theweb. The online content management software is allows management of several websitesusing a single CMS. Our other services in CMS domain include

• Website content management systems• Custom CMS Software development & outsourcing• Web content management systems• Online CMS for ecommerce and medical CMS• Workflow and Template management• Enterprise Content management systems• Portal CMS development• .Net Content management system

5. Web Content Management Problems and Open Source Solutions

The open source community has produced a number of useful, high quality contentmanagement systems which presents an opportunity to deliver tailored contentmanagement solutions without the high licensing or management fees associated withcommercially-licensed or hosted software. However, the sheer number of open sourceCMS projects and the ineffectualness of traditional commercial software selectiontechniques can make the task of finding the right open source software an intimidatingchallenge. The strategy of using feature matrices is particularly ill-suited to open sourcesoftware selection. A more practical approach is to match your needs to a common businessproblem that others have solved using open source software and engage with thecommunity to learn about their experiences in implementing the solution.Doing so will take advantage of the unique aspects of open source software: the opennessof the user community and the transparency of the development process.

6. Technology Based Open Source Web Content Management Systems:

6.1 LAMP Technology : Joomla

Considered to be in the top three of any CMS list, by any CMS user. Joomla hasbeen used to build many corporate and professional websites, such as the United Nationspage.

Page 20: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

120Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

Figure 1: Admin Module of Joomla 1.5

6.2. Microsoft DOTNET Technology Web CMS : SharePoint

Microsoft SharePoint [3] is a web application that enables users within anorganization to work together, collaborate, more efficiently through its vast number offeatures.

6.3. Java Technology Open Source Web CMS: Magnolia and Serena SoftwareMost enterprise Web Content Management (WCM) solutions are built for a Java

application server. There are also Java-based WCM offerings for the small to mid-sizedenterprise (SME) market.

Figure 2: Admin Module SharePoint

Page 21: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

121Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

7. Web Content Management Latest News Worldwide

Limelight Buys Web Content Management System Clickability

paidContent.org -

Limelight Networks (NSDQ: LLNW), the publicly-traded content delivery network, hasbought up Click ability, which is behind a popular web content management system,

TERMINALFOUR’s Web Content Management Solution Adopted by University.SBWire (press release) ý

TERMINALFOUR was chosen following an international review of suitable web contentmanagement systems. UMass Dartmouth has been awarding undergraduate degrees, mastersand doctorates in a wide variety of specialties including business, arts and science.

Mobile is the Next Frontier for Web Content Management Technology

San Francisco Chronicle (press release)

Web content management vendors are making only halting moves towards mobile-enabling their applications and outputs, but they will apply significant resources to mobileservices over the next year, according to independent industry analyst firm.Content Management: “Extend the Productivity of Backoffice Systems”

Constructech - ýMay 5, 2011ý

IT is able to more effectively support the entire business while implementing a singlesolution for data management, archiving, and backup. Subcontractors and other third partiesare able to access information from the Web and get the answers they need.

8. Conclusion and Future direction

From the paper we conclude that A web content management system can provide anefficient and sustainable method to assure high quality web administration, developmentand online communication.

A web Content Management System is used to create, publish, and maintain contenton a Website.

Page 22: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

122Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

In short, a web application, which is used for managing the website templates and theweb content, easily and dynamically. Content management system gives you the tools tocreate and manage your Web site the way you want. Web CMS provides templates and formsto make it easy to add and update content on the site. Content management can help yourWeb site progress more rapidly. Web CMS manages the lifecycle of content from early stagethrough the approval and modification process.

To take advantage of the business opportunities offered by the Web, companiesnecessitate Content Management Systems that manage and deliver their Web presence withfast, accurate and easy selection.

While Content Management Systems are clearly capable of delivering significantbusiness remuneration to organizations with less flexible web site management technologies,the decision to purchase a CMS cannot be based simply on some theoretical value of benefitThe future Web CMS development will come up with improved technologies for

• Reuse of content• Quick content creation and publish without any time delay.• Integration of various internal applications – Integration• Improved corporate and client communication• Integrating external system and content - Content Aggregation & Syndication• Multiple access and transmission

9. References

[1] Price Richard, “Transitioning from A centralized Webmaster Model” A distributedWeb Content Management System”

http://www.usfa.dhs.gov/pdf/efop/efo41317.pdf,October,2007.[2] MSDN website, Web Content Management Overview, http://msdn.microsoft.com/en- us/library/ms580943(v=office.12).aspx , 2010[3] Bamboo Nation, Overview of Web Content Management in SharePoint 2010, http:/

/community.bamboosolutions.com/blogs/sharepoint- 2010/archive/2009/11/10/overview-of-web-content-management-in-sharepoint- 2010.aspx , June, 2010.

[4] Sol, S. (2007, April 18). Web Developer’s Virtual Library. What is awebmaster?http://www.wdvl.com/Internet/Web/Jobs/webmaster.html, September 21, 2007.

[5] Google news, “Web Content Management news”http://news.google.co.in/nwshp?hl=en&tab=wn&q=web%20content%20management%20news

Page 23: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

Web Use and Misuse Pramod Kumar Nayak,

Research Scholar, Manav Bharti University, SolanEmail ID: [email protected]

Ajai Pal Sharma,Research Scholar, School of Social Sciences,

IGNOU, DelhiEmail ID: [email protected]

ABSTRACT

The Information Technology has changed the atmosphere of communication ina wide and significant manner and it is on the continuous changing process. There israpid increasing rate of internet use and millions of web users are joining the internetzone. The usefulness of internet is commendable but simultaneously the misuse of thepersonal information is also in high rise. This paper aims at the issue of sensitivity ofpersonal information and its misuse over internet or web. The type of web activity byweb users and the sensitivity of personal information are highlighted in this paper.

Key Words: Cookies, Search Engine, Web, Spam, HTTP, URL, etc.

1. INTRODUCTION

In the modern age of science and technology computer has made a significantcontribution towards the development of communication system. The computer has agreatest impact on human life now days. Internet plays a significant role in this context.“The Internet is a worldwide network where different kinds of information can betransferred between computers. The Internet network is maintained by severalorganizations. WWW (web) is a globally interconnected network containing hypermediainformation. It consists of: the Internet (network), the document pointing system (URL),the protocol for transmission of hypermedia documents (HTTP) and a set of servers that

Page 24: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

124Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

respond to requests from browsers (or clients) for those hypermedia documents. TheWWW is a hypertext-based, distributed information system created by researchers atCERN (European Laboratory for Particle Physics) in Switzerland and France (project started1989)” (KERMINEN, KAI PETTERI, 1998).Since the use of Internet has changed its pace, millions of users are now connected toInternet in a 24X7 environment globally and getting the benefit out of it we can’t ignorethe fact of misuse of the web.

2. OBJECTIVE

The main objective of the paper is to address increasing levels of web misuse,and to identify the instances of web misuse and to find out the degree of sensitivity ofthe personal information.

3. THEORITICAL PERSPECTIVE

There are a number of studies carried out on the prevention of web misuse andthe most significant is the study by the Informatics Corporation, Canada and in theirpoint of view the personal information collected from the web users while they use theweb and uses them for their business benefits.Some of the e-Commerce activities which involve online surfing by the users of the web/internet are more vulnerable for identity theft or misuse. They are as follows:

• Online retailers who gather data on potential customers’ web activities and use itto target them with advertising.

• Online behavioral advertising typically uses information collected through“cookies”, files which are placed on a user’s computer after their first visit to thewebsite.

• Companies might use data collected on a person’s location, likely income, buyingpreferences or age to offer them prices that differ to those offered to othercustomers.

Source: Juxt India Online 2009 Study (Business World Marketing Whitebook 2010-11)

Total Daily Users (32.38) million

All Internet Users In India (46.49 millions) (100%)

Use less than once a month(Occasional Users)

Use at least once a month(Regular Users)

Urban Occasional Rural Occasional Urban Regular Rural Regular

Daily Urban Daily Rural

Page 25: Figure 6(a). Conceptual diagram illustrating vector quantization … · 2016-11-07 · GMM based Speaker Recognition. [9] Mahdi Shaneh, and Azizollah Taheri., Voice Command Recognition

125Proceedings of National Conference On Web and Knowledge Based Systems (WKBS)

There are 32.38 million daily internet users in India as per the study above. Thereis growing concern among web users about how private data is made public and thepossibility of misuse of personal information. The information collected is used for businessactivities and also being used for proxy representation of identity. The increasing numberof user in social networking sites and online business sites is making the WWW (WorldWide Web) a safe haven for the information seekers to gather the personal information.

4. RESEARCH DESIGN

The study of the Informatics Corporation, Canada has been taken as the base ofthe research and a sample of 200 potential web users of Delhi with varied demographylike (IT professional, Corporate Executives, Students, Salaried Employees, Businessmenetc.) has been taken by convenient sampling. A structured questionnaire being circulatedamong the respondents and they are asked to share their views and experiences on WebUse and Misuse.

5. ANALYSIS

The information gathered from the respondents were analyzed and the following outcomesare observed:

The outcomes of the study were divided into two main broad categories. What the webusers generally do over internet and what kind of web misuses they face over the internet.

The web user activity over internet

Type of activity on the web Percentage of Respondent (Total

Respondent = 200)

Used The Search Engine (like Google, Yahoo) 98

Sent/received an e-mail 90

Watched TV Shows, Movies and Sports 35

Online Shopping/Online Ticket booking 60

Listened MP3s 80

Download activity 95

Watched streamed Video on You Tube 25

Online transaction(Financial) 8

Social networking 50