2024-03-28T17:35:25Zhttp://oai-repositori.upf.edu/oai/requestoai:repositori.upf.edu:10230/169692018-01-24T08:14:40Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Hernández Leo, Davinia
author
Nieves, Raul
author
Arroyo, Ernesto
author
Rosales, Andrea
author
Melero Merino, Javier
author
Moreno, Pau
author
Blat, Josep
author
2011
The orchestration of collaborative learning processes in face-to-face/nphysical settings, such as classrooms, requires teachers to coordinate students indicating them who belong to each group, which collaboration areas are/nassigned to each group, and how they should distribute the resources or roles within the group. In this paper we present an Orchestration Signal system,/ncomposed of wearable Personal Signal devices and an Orchestration Signal manager. Teachers can configure color signals in the manager so that they are transmitted to the wearable devices to indicate different orchestration aspects./nIn particular, the paper describes how the system has been used to carry out a Jigsaw collaborative learning flow in a classroom where students received signals indicating which documents they should read, in which group they were and in which area of the classroom they were expected to collaborate. The evaluation results show that the proposed system facilitates a dynamic, visual and flexible orchestration.
Hernández-Leo D, Nieves R, Arroyo E, Rosales A, Melero J, Moreno P, Blat J. Orchestration signals in the classroom: managing the jigsaw collaborative learning flow. En: Towards Ubiquitous Learning 6th European Conference of Technology Enhanced Learning, EC-TEL 2011, Palermo, Italy, September 20-23, 2011. Proceedings. Heidelberg: Springer, 2011. p. 153-165. (LNCS, no. 6964). DOI: 10.1007/978-3-642-23985-4_13
0302-9743
http://hdl.handle.net/10230/16969
http://dx.doi.org/10.1007/978-3-642-23985-4_13
Orchestration signals in the classroom: managing the jigsaw collaborative learning flow
oai:repositori.upf.edu:10230/169702020-04-24T07:59:38Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Arroyo, Ernesto
author
Righi, Valeria
author
Tarrago, Roger
author
Santos Rodríguez, Patrícia
author
Hernández Leo, Davinia
author
Blat, Josep
author
2011
This paper presents a customizable system used to develop a collaborative multi-user problem solving game. It addresses the increasing demand for appealing informal learning experiences in museum-like settings. The system facilitates remote collaboration by allowing groups of learners to/ncommunicate through a videoconferencing system and by allowing them to simultaneously interact through a shared multi-touch interactive surface. A user study with 20 user groups indicates that the game facilitates collaboration between local and remote groups of learners. The videoconference and multitouch surface acted as communication channels, attracted students’ interest, facilitated engagement, and promoted inter- and intra-group collaboration—/nfavoring intra-group collaboration. Our findings suggest that augmenting/nvideoconferencing systems with a shared multitouch space offers new/npossibilities and scenarios for remote collaborative environments and collaborative learning.
Arroyo E, Righi V, Tarrago R, Santos P, Hernández-Leo D, Blat J. Remote collaborative multi-user informal learning experiences: design and evaluation. En: Towards Ubiquitous Learning 6th European Conference of Technology Enhanced Learning, EC-TEL 2011, Palermo, Italy, September 20-23, 2011. Proceedings. Heidelberg: Springer, 2011. p. 43-56. DOI: 10.1007/978-3-642-23985-4_5
http://hdl.handle.net/10230/16970
http://dx.doi.org/10.1007/978-3-642-23985-4_5
Remote collaborative multi-user informal learning experiences: design and evaluation
oai:repositori.upf.edu:10230/169922020-04-24T07:28:18Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Pérez Sanagustín, Maria del Mar
author
Burgos, J.
author
Hernández Leo, Davinia
author
Blat, Josep
author
2011
When applying a Collaborative Learning Flow Pattern (CLFP) to structure sequences of activities in real contexts, one of the tasks is to organize groups of students according to the constraints imposed by the pattern. Sometimes,/nunexpected events occurring at runtime force this pre-defined distribution to be changed. In such situations, an adjustment of the group structures to be adapted to the new context is needed. If the collaborative pattern is complex, this group redefinition/nmight be difficult and time consuming to be carried out in real time. In this context, technology can help on notifying the teacher which incompatibilities/nbetween the actual context and the constraints imposed by the pattern. This chapter presents a flexible solution for supporting teachers in the group organization profiting from the intrinsic constraints defined by a CLFPs codified in IMS Learning Design. A prototype of a web-based tool for the TAPPS and Jigsaw CLFPs and the preliminary results of a controlled user study are also/npresented as a first step towards flexible technological systems to support grouping tasks in this context.
Pérez-Sanagustín M, Burgos J, Hernández-Leo D, Blat J. CLFP intrinsic constraints-based group management of blended learning situations. En: Daradoumis T, Caballé S, Juan AA, Xhafa F, editors. Technology-Enhanced Systems and Tools for Collaborative Learning Scaffolding. Heidelberg: Springer Berlin; 2011 (Studies in Computational Intelligence; 350), p. 115-133. DOI: 10.1007/978-3-642-19814-4_6
978-3-642-19813-7
http://hdl.handle.net/10230/16992
http://dx.doi.org/10.1007/978-3-642-19814-4_6
CLFP intrinsic constraints-based group management of blended learning situations
oai:repositori.upf.edu:10230/169932020-04-24T07:39:39Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Chacón Pérez, Jonathan, 1986-
author
Hernández Leo, Davinia
author
Blat, Josep
author
2011
Collaborative activities, in which students actively interact with each other, have proved to provide significant learning benefits. In Computer-Supported Collaborative Learning (CSCL), these collaborative activities are assisted by technologies. However, the use of computers does not guarantee collaboration, as free collaboration does not necessary lead to fruitful learning. Therefore, practitioners need to design CSCL scripts that structure the collaborative settings so that they promote learning. However, not all teachers have the technical and pedagogical background needed to design such scripts. With the aim of assisting teachers in designing effective CSCL scripts, we propose a model to support the selection of reusable good practices (formulated as patterns) so that they can be used as a starting point for their own designs. This model is based on a pattern ontology that computationally represents the knowledge captured on a pattern language for the design of CSCL scripts. A preliminary evaluation of the proposed approach is provided with two examples based on a set of meaningful interrelated patters computationally represented with the pattern ontology, and a paper prototyping experience carried out with two teaches. The results offer interesting insights towards the implementation of the pattern ontology in software tools.
Chacón J, Hernández-Leo D, Blat J. From a pattern-language to a pattern ontology approach for CSCL script design. En: Murgante B, Gervasi O, Iglesias A, Taniar D, Apduhan B, editors. Computational Science and Its Applications - ICCSA 2011 International Conference, Santander, Spain, June 20-23, 2011. Proceedings, Part IV. Heidelberg: Springer Berlin; 2011 (Lecture Notes in Computer Science; 6785), p. 547-561. DOI: 10.1007/978-3-642-21898-9_45
978-3-642-21897-2
http://hdl.handle.net/10230/16993
http://dx.doi.org/10.1007/978-3-642-21898-9_45
From a pattern-language to a pattern ontology approach for CSCL script design
oai:repositori.upf.edu:10230/204372023-03-10T15:38:03Zcom_10230_5542com_10230_5922col_10230_8581col_10230_16968
00925njm 22002777a 4500
dc
Serra, Xavier
author
Rao, Preeti
author
Murthy, Hema A.
author
Bozkurt, Baris
author
2012
Serra X, Rao P, Murthy H, Bozkurt B, editors. Proceedings of the 2nd CompMusic Workshop; 2012 Jul 12-13; Istanbul, Turkey. Barcelona: Universitat Pompeu Fabra; 2012. 159 p.
978-84-695-4958-2
http://hdl.handle.net/10230/20437
Proceedings of the 2nd CompMusic Workshop
oai:repositori.upf.edu:10230/217662021-11-23T13:16:03Zcom_10230_5922com_10230_5542col_10230_16968col_10230_8581
00925njm 22002777a 4500
dc
Serra, Xavier
author
Magas, Michela
author
Benetos, Emmanouil
author
Chudy, Magdalena
author
Dixon, Simon
author
Flexer, Arthur
author
Gómez Gutiérrez, Emilia, 1975-
author
Gouyon, Fabien
author
Herrera Boyer, Perfecto, 1964-
author
Jordà Puig, Sergi
author
Paytuvi, Oscar
author
Peeters, Geoffroy
author
Schlüter, Jan
author
Vinet, Hugues
author
Widmer, Gerhard
author
2013
Serra X, Magas M, Benetos E, Chudy M, Dixon S, Flexer A, Gómez E, Gouyon F, Herrera P, Jorda S, Paytuvi O, Peeters G, Schlüter J, Vinet H, Widmer G, Roadmap for Music Information ReSearch. Geoffroy Peeters (editor); 2013. 88 p. ISBN: 9782954035116.
9782954035116
http://hdl.handle.net/10230/21766
Roadmap for Music Information ReSearch
oai:repositori.upf.edu:10230/226552020-04-24T08:17:44Zcom_10230_5922com_10230_5542col_10230_16968col_10230_8581
00925njm 22002777a 4500
dc
Papiotis, Panagiotis, 1985-
author
Marchini, Marco, 1984-
author
Maestre Gómez, Esteban
author
Pérez Carrillo, Alfonso Antonio, 1977-
author
2012
In this article we present our ongoing work on expressive /nperformance analysis for violin and string ensembles, in terms of synchronization in intonation, timing, dynamics and articulation. Our current /nresearch objectives are outlined, along with an overview for the methods used /nto achieve them; finally, focusing on the case of intonation synchronization in /nviolin duets, some preliminary results and conclusions based on experimental /nrecordings are discussed.
Papiotis P, Marchini M, Maestre E, Perez A. Measuring ensemble synchrony through violin performance parameters: a preliminary progress report. In: Camurri A, Costa C, editors. Intelligent Technologies for Interactive Entertainment SE - 36. Springer Berlin Heidelberg; 2012. p. 267–72. DOI 10.1007/978-3-642-30214-5_36
http://hdl.handle.net/10230/22655
http://dx.doi.org/10.1007/978-3-642-30214-5_36
Measuring ensemble synchrony through violin performance parameters: a preliminary progress report
oai:repositori.upf.edu:10230/281462018-01-24T08:34:33Zcom_10230_5542com_10230_5922col_10230_8581col_10230_16968
00925njm 22002777a 4500
dc
Bertalmío, Marcelo
author
2014
Image Processing for Cinema presents a detailed overview of image processing techniques that are used in practice in digital cinema. The book shows how image processing has become ubiquitous in movie-making, from shooting to exhibition. It covers all the ways in which image processing algorithms are used to enhance, restore, adapt, and convert moving images. These techniques and algorithms make the images look as good as possible while exploiting the capabilities of cameras, projectors, and displays.
The author focuses on the ideas behind the methods, rather than proofs and derivations. The first part of the text presents fundamentals on optics and color. The second part explains how cameras work and details all the image processing algorithms that are applied in-camera. With an emphasis on state-of-the-art methods that are actually used in practice, the last part describes image processing algorithms that are applied offline to solve a variety of problems.
The book is designed for advanced undergraduate and graduate students in applied mathematics, image processing, computer science, and related fields. It is also suitable for academic researchers and professionals in the movie industry.
Bertalmío M. Image processing for cinema. 1st ed. Boca Raton: Taylor & Francis; 2014. Chapter 1, Light and color; p. 3-26.
http://hdl.handle.net/10230/28146
Light and color
oai:repositori.upf.edu:10230/321132018-01-24T08:14:55Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Hernández Leo, Davinia
author
Asensio-Pérez, Juan I.
author
Dimitriadis, Yannis
author
Villasclaras-Fernández, Eloy D.
author
2010
E-learning environments designed for computer-supported collaborative learning
(CSCL) mediate social interactions as key activators of learning). However, free
collaboration does not necessarily by itself produce the interactions we want.
Scaffolding collaboration can increase the probability of successful learning
outcomes (Fischer, Kollar, Mandl, & Haake, 2007). CSCL scripts embedded in
(Dillenbourg & Jermann, 2007), or interpreted by (Hernández-Leo et al., 2006c), elearning
environments aim to shape the way learners interact with each other to
elicit fruitful interactions.
The design of effective scripts is a non-trivial task that requires significant
expertise in, and knowledge of, the possibilities and risks of structuring
collaboration (Fischer et al., 2007). This chapter focuses on patterns as a way of
formulating and sharing experience regarding the design of potentially effective
scripted collaborative learning situations.
Design patterns capture reusable knowledge about a contextualized problem and
its associated, broadly accepted, solution. Patterns are decoupled when they are
applied, but they work together with other interconnected patterns to generate
emergent contextualized wholes. A pattern language (PL) embraces a set of
patterns relevant to a specific design space, together with the rules that link the
patterns together in meaningful ways, so that they provide guidance when creating
a space-related whole (Alexander et al., 1977).
In this chapter we aim to identify the types of patterns, and connections between
patterns, that can be used for generating CSCL scripts. These types of patterns and
relationships are formulated as a conceptual model (or meta-language) for
describing CSCL scripting PLs. That is to say, CSCL scripting is the design space
of the patterns and rules that can be situated in the proposed conceptual model. We
believe this model can provide the scientific community with a starting point for an
agreed high-level structure for the production of patterns and PLs that enable the
generation of CSCL scripts. Each institution or community of practice may have its
own patterns of effective scripted CL situations that typify that particular
community. We can foster the sharing and communication of good practice within
and between communities if such practice can be framed within the same
conceptual model (Goodyear, de Laat & Lally, 2006).
To illustrate the feasibility of this proposal, an Appendix to this chapter includes
a CSCL scripting PL (with our own and some adopted patterns) that can be
described with the conceptual model. The PL comprises 18 patterns; each pattern
documents its relationships to other patterns. The map of relationships sketches
many ways in which the patterns may be put together when creating different
CSCL scripts. Different patterns and connections of patterns may or may not apply,
depending on the context of a particular educational situation. Nevertheless, it is
important to point out that the PL is not complete as a set, in the sense that these
patterns cannot be used to generate any CSCL script. Each community can
augment the PL with its own patterns, or propose different ones (which might
borrow from some of the other patterns). This chapter also sets out a real scripted
CL situation generated using the proposed PL. The situation expresses and
illustrates the relationships between the patterns and shows how the diverse types
of CSCL scripting patterns can be applied.
We start by describing the methodology used to propose the conceptual model
and the PL.
Hernández-Leo D, Asensio-Pérez JI, Dimitriadis Y, Villasclaras-Fernández ED. Generating CSCL scripts: from a conceptual model of pattern languages to the design of real scripts. In: Goodyear P, Retalis S, eds. Technology-enhaced learning: design patterns and pattern languages. Rotterdam: Sense Publishers; 2010. p. 49-64, appendix.
9789460910609
http://hdl.handle.net/10230/32113
Generating CSCL scripts: from a conceptual model of pattern languages to the design of real scripts
oai:repositori.upf.edu:10230/328172020-04-24T08:49:10Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Pragst, Louisa
author
Ultes, Stefan
author
Minker, Wolfgang
author
2016
Getting a good estimation of the Interaction Quality (IQ) of a spoken dialogue helps to increase the user satisfaction as the dialogue strategy may be adapted accordingly. Therefore, some research has already been conducted in order to automatically estimate the Interaction Quality. This paper adds to this by describing how Recurrent Neural Networks may be used to estimate the Interaction Quality for each dialogue turn and by evaluating their performance on this task. Here, we will show that RNNs may outperform non-recurrent neural networks.
Pragst L, Ultes S, Minker W. Recurrent neural network interaction quality estimation. In: Jokinen K, Wilcock G, editors. Dialogues with social robots: enablements, analyses, and evaluation. Singapore: Springer; 2016. p. 381-93. (LNEE; no. 427). DOI: 10.1007/978-981-10-2585-3_31
http://hdl.handle.net/10230/32817
http://dx.doi.org/10.1007/978-981-10-2585-3_31
Recurrent neural network interaction quality estimation
oai:repositori.upf.edu:10230/332932021-06-10T10:43:24Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Morenza-Cinos, Marc
author
Casamayor Pujol, Víctor
author
Soler Busquets, Jordi
author
Sanz, José Luis
author
Guzmán, Roberto
author
Pous Andrés, Rafael
author
2017
AdvanRobot proposes a new robot for inventorying and locating all the products inside a retail store without the need of installing any fixed infrastructure. The patent pending robot combines a laser-guided autonomous robotic base with a Radio Frequency Identification (RFID) payload composed of several RFID readers and antennas, as well as a 3D camera. AdvanRobot is able not only to replace human operators, but to dramatically increase the efficiency and accuracy in providing inventory, while also adding the capacity to produce store maps and product location. Some important benefit of the inventory capabilities of AdvanRobot are the reduction in stock-outs, which can cause a drop in sales and are the most important source of frustration for customers; the reduction of the number of items per reference maximizing the number of references per square meter; and reducing the cost of capital due to over-stocking [1, 7]. Another important economic benefit expected from the inventorying and location capabilities of the robot is the ability to efficiently prepare on-line orders from the closest store to the customer, allowing retailers to compete with the likes of Amazon (a.k.a. omnichannel retail). Additionally, the robot enables to: produce a 3D model of the store; detect misplaced items; and assist customers and staff in finding products (wayfinding).
Morenza-Cinos M, Casamayor-Pujol V, Soler Busquets J, Sanz JL, Guzmán R, Pous R. Development of an RFID Inventory Robot (AdvanRobot). In: Koubaa A, editor. Robot Operating System (ROS). [Germany]: Springer; 2017. p. 387-417. DOI: 10.1007/978-3-319-54927-9_12
http://hdl.handle.net/10230/33293
http://dx.doi.org/10.1007/978-3-319-54927-9_12
Development of an RFID Inventory Robot (AdvanRobot)
oai:repositori.upf.edu:10230/334822018-01-24T08:03:29Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Deco, Gustavo
author
Jirsa, Viktor K.
author
Friston, Karl
author
2012
Global network dynamics over distributed brain areas emerge from the local
dynamics of each brain area. Conversely, global dynamics constrain local activity
such that the whole system becomes self-organizing. The implicit coupling
between local and global scales induces a form of circular causality that is characteristic
of complex, coupled systems that show self-organization, such as the
brain. Here we present a network model based on spiking neurons at the local
level and large-scale anatomic connectivity matrices at the global level. We demonstrate
that this multiscale network displays endogenous or autonomous
dynamics of the sort observed in resting-state studies. Our special focus here is
on the genesis of itinerant (wandering) dynamics and the role of multistable
attractors, which are involved in the generation of empirically known functional
connectivity patterns, if the global coupling causes the dynamics to operate in
the critical regime. Our results provide once again support for the hypothesis
that endogenous brain activity is critical.
Deco G, Jirsa V, Friston KJ. The dynamical and structural basis of brain activity. In: Rabinovich M, Friston KJ, Varona P, editors. Principles of brain dynamics: global state interactions. Cambridge: MIT Press; 2012. p. 9-25.
9780262017640
http://hdl.handle.net/10230/33482
The Dynamical and structural basis of brain activity
oai:repositori.upf.edu:10230/340112018-02-28T09:46:17Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Amatriain, Xavier
author
Bonada, Jordi, 1973-
author
Loscos, Àlex
author
Serra, Xavier
author
2002
In the context of this book, we are looking for representations of sound signals and signal processing systems that can give us ways to design sound transformations in a variety of music applications and contexts. It should have been clear throughout the book, that several points of view have to be considered, including a mathematical, thus objective perspective, and a cognitive, thus mainly subjective, standpoint. Both points of view are necessary to fully understand the concept of sound effects and to be able to use the described techniques in practical situations.
The mathematical and signal processing points of view are straightforward to present, which does not mean easy, since the language of the equations and of flow diagrams is suitable for them. However, the top-down implications are much harder to express due to the huge number of variables involved and to the inherent perceptual subjectivity of the music making process. This is clearly one of the main challenges of the book and the main reason for its existence.
The use of a spectral representation of a sound yields a perspective that is sometimes closer to the one used in a sound engineering approach. By understanding the basic concepts of frequency domain analysis, we are able to acquire the tools to use a large number of effects processors and to understand many types of sound transformations systems. Moreover, being the frequency domain analysis a somewhat similar process than the one performed by the human hearing system, it yields fairly intuitive intermediate representations.
The basic idea of spectral processing is that we can analyze a sound to obtain alternative frequency domain representations, which can then be transformed and inverted to produce new sounds. Most of the approaches start by developing an analysis/synthesis system from which the input sound is reconstructed without any perceptual loss of sound quality. The techniques described in chapter 8 are clear examples of this approach. Then the main issue is what is the intermediate representation and what parameters are available for applying the desired transformations.
Perceptual or musical concepts such as timbre or pitch are clearly related to the spectral characteristics of a sound. Even some common processes for sound effects are better explained using a frequency domain representation. We usually think on the frequency axis when we talk about equalizing, filtering, pitch shifting, harmonizing... In fact, some of them are specific to this signal processing approach and do not have an immediate counterpart on the time domain. On the other hand, most (but not all) of the sound effects presented in this book can be implemented in the frequency domain.
Another issue is whether or not this approach is the most efficient, or practical, for a given application. The process of transforming a time domain signal into a frequency domain representation is, by itself, not an immediate step. Some parameters are difficult to adjust and force us to take several compromises. Some settings, such as the size of the analysis window, have little or nothing to do with the high-level approach we intend to favor, and require the user to have a basic signal processing understanding.
In that sense, when we talk about higher level spectral processing we are thinking of an intermediate analysis step in which relevant features are extracted, or computed, from the spectrum. These relevant features should be much closer to a musical or high-level approach. We can then process the features themselves or even apply transformations that keep some of the features unchanged. For example, we can extract the fundamental frequency and the spectral shape from a sound and then modify the fundamental frequency without affecting the shape of the spectrum.
Assuming the fact that there is no single representation and processing system optimal for everything, our approach will be to present a set of complementary spectral models that can be combined to be used for the largest possible set of sounds and musical applications.
Amatriain X, Bonada J, Loscos A, Serra X. Spectral processing. In: Zölzer U, editor. DAFX - Digital Audio Effects. Chichester: John Wiley & Sons; 2002. p. 373-438 DOI: 10.1002/9781119991298.ch10
978-0-470-66599-2
http://hdl.handle.net/10230/34011
http://dx.doi.org/10.1002/9781119991298.ch10
Spectral Processing
oai:repositori.upf.edu:10230/340602018-03-08T02:31:04Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Serra, Xavier
author
Leman, Marc
author
Widmer, Gerhard
author
2007
Music is an important aspect of all human cultures. Music is meant to give new experiences, to give sense and meaning to life, to console and to promote social coherence and personal identity in and across very diverse social and ethnic groups. Rooted in the biology of every human being, music is a core occupation of our technological society. By 2020, music will have become a commodity as ubiquitous as water or electricity. Its content and the activities surrounding it will promote new business ventures, which in turn will bolster the music and cultural/creative industries. Sound and Music Computing (SMC) will provide the core technologies for this ongoing revolution in electronic music culture. Its major research contribution to advances in the field will be to bridge the semantic gap, the hiatus that currently separates sound from sense. This contribution will stimulate fruitful interaction between culture, science and industry.
Serra X, Leman M, Widmer G, editors. A Roadmap for sound and music computing. [place unknown]: The S2S2 Consortium; 2007. 167 p.
http://hdl.handle.net/10230/34060
A Roadmap for sound and music computing
oai:repositori.upf.edu:10230/340722023-04-05T11:52:54Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Serra, Xavier
author
1989
This dissertation introduces a new analysis/synthesis method. It is designed to obtain musically useful intermediate representations for sound transformations. The method’s underlying model assumes that a sound is composed of a deterministic component plus a stochastic one. The deterministic component is represented by a series of sinusoids that are described by amplitude and frequency functions. The stochastic component is represented by a series of magnitude-spectrum envelopes that function as a time-varying filter excited by white noise. Together these representations make it possible for a synthesized sound to attain all the perceptual characteristics of the original sound. At the same time the representation is easily modified to create a wide variety of new sounds. This analysis/synthesis technique is based on the short-time Fourier transform (STFT). From the set of spectra returned by the STFT, the relevant peaks of each spectrum are detected and used as breakpoints in a set of frequency trajectories. The deterministic signal is obtained by synthesizing a sinusoid from each trajectory. Then, in order to obtain the stochastic component, a set of spectra of the deterministic component is computed, and these spectra are subtracted from the spectra of the original sound. The resulting spectral residuals are approximated by a series of envelopes, from which the stochastic signal is generated by performing an inverse-STFT. The result is a method that is appropriate for the manipulation of sounds. The intermediate representation is very flexible and musically useful in that it offers unlimited possibilities for transformation.
Serra X. A system for sound analysis/transformation/synthesis based on a deterministic plus stochastic decomposition [dissertation]. [Stanford]: Stanford University; 1989. 166 p.
http://hdl.handle.net/10230/34072
A system for sound analysis/transformation/synthesis based on a deterministic plus stochastic decomposition
oai:repositori.upf.edu:10230/342862020-06-11T09:42:13Zcom_10230_5542com_10230_5922col_10230_8581col_10230_16968
00925njm 22002777a 4500
dc
Gouyon, Fabien
author
Herrera Boyer, Perfecto, 1964-
author
Gómez Gutiérrez, Emilia, 1975-
author
Cano Vila, Pedro
author
Bonada, Jordi, 1973-
author
Loscos, Àlex
author
Amatriain, Xavier
author
Serra, Xavier
author
2008
In this chapter, we provide an overview of state-of-the-art algorithms for the automatic description of music audio signals, both from a low-level perspective (focusing on signal characteristics) and a more musical perspective (focusing on musically meaningful dimensions). We also provide examples of applications based on this description, such as music identification, music browsing and music signal transformations. Throughout the chapter, a special focus is put on promising research directions.
Gouyon F, Herrera P, Gómez E, Cano P, Bonada J, Loscos A, Amatriain X, Serra X. Content processing of music audio signals. In: Polotti P, Rocchesso D, editors. Sound to sense, sense to sound: a state of the art in sound and music computing. Berlin: Logos Verlag; 2008. p. 83-160.
978-3-8325-1600-0
http://hdl.handle.net/10230/34286
Content processing of music audio signals
oai:repositori.upf.edu:10230/356552018-10-26T01:30:38Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Parés, Narcís, 1966-
author
Carreras, Anna
author
2009
In this chapter we present the design process followed for an interactive experience in a museum installation for children of age 14 to 18. The experience wishes to communicate a set of abstract concepts through full-body interaction following the interaction-driven design strategy. We also present a design framework that we have derived from the design process of this and previous full-body interactive experiences, in an attempt to generalize the steps we have followed. This framework is based on five levels, namely: conceptual, symbolic, semantic, user attitude, and user action levels. We believe this will allow designers to achieve experiences that better communicate abstract concepts and notions through interaction itself by making the users “live” the experiences in their own flesh through full-body interaction.
Carreras A, Parés N. Designing an interactive installation for children to experience abstract concepts. In: Macías JA, Granollers A, Latorre PM, editors. New trends on human-computer interaction: Research, development, new tools and methods. New York: Springer; 2009. p. 33-42. DOI: 10.1007/978-1-84882-352-5_4
http://hdl.handle.net/10230/35655
http://dx.doi.org/10.1007/978-1-84882-352-5_4
Designing an interactive installation for children to experience abstract concepts
oai:repositori.upf.edu:10230/356742018-10-31T15:07:16Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Farrús, Mireia
author
2010
Farrús M. Prosody in automatic speaker recognition: applications in biometrics and voice imitation. Saarbrücken, Germany: VDM Verlag Dr. Müller; 2010. 216 p.
9783639089912
http://hdl.handle.net/10230/35674
Prosody in automatic speaker recognition: applications in biometrics and voice imitation
oai:repositori.upf.edu:10230/449092020-06-08T07:51:59Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Herrera Boyer, Perfecto, 1964-
author
2009
Herrera Boyer P. Música y persuasión. En: La música y su reflejo en la sociedad. Barcelona: Indigestió Musical; 2009. p. 27-38. (Idees per la música; no. 4).
978-84-613-2014-1
http://hdl.handle.net/10230/44909
Música y persuasión
oai:repositori.upf.edu:10230/449182020-06-09T01:30:30Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Herrera Boyer, Perfecto, 1964-
author
2009
Herrera Boyer P. Música i persuasió. Dins: La música i el seu reflex en la societat. Barcelona: Indigestió Musical; 2009. p. 27-38. (Idees per la música; no. 4).
978-84-613-2013-4
http://hdl.handle.net/10230/44918
Música i persuasió
oai:repositori.upf.edu:10230/453132020-09-19T01:30:49Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Serra, Xavier
author
2002
Serra X. The Musical communication chain and its modeling. In: Assayag G, Fechtinger HG, Rodrigues JF, editors. Mathematics and music: a Diderot mathematical forum. Berlin: Sprimger-Verlag; 2002. p. 243-55.
978-3-540-43727-7
http://hdl.handle.net/10230/45313
The Musical communication chain and its modeling
oai:repositori.upf.edu:10230/456272021-04-08T08:05:44Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Duran Cals, Jordi
author
Villarejo Muñoz, Luis
author
Farrús, Mireia
author
Ortiz, Sergio
author
Ramírez, Gemma
author
2013
The Universitat Oberta de Catalunya (Open University of Catalonia, UOC), is a public university based in Barcelona. The UOC is characterised by three main factors: (a) it is a virtual university based in an e-Learning model, (b) it is based in a strongly Spanish-Catalan bilingual region, and (c) students come from around the world, so that linguistic and cultural diversity is a crucial factor.
Within this context, it becomes essential to meet the UOC’s linguistic needs taking into account its particular characteristics. One of the tools created to this end is the adaptation of Apertium, a free/open-source rule-based machine translation platform, which can be found under http://apertium.uoc.edu/, customised to the translation needs of the institution in order to offer the best possible service to their user community.
In order to continue adapting and adding value to the existing tool for generalisable large-scale applications, the UOC’s translation system has recently implemented a semantic filter based on subject fields aimed at improving the translation quality and at better fitting the university needs. The paper will explain all the steps of this adaptive process, as well as a demonstration of the resulting tool: (a) the choice of the subject fields according to the university studies, (b) the design and implementation of the dictionaries used to extract the required information to filter and disambiguate homonym and polysemous terms, including source code in the dictionaries, and (c) the design and implementation of the corresponding web interface.
Duran J, Villarejo L, Farrús M, Ortiz S, Ramírez G. Incorporating subject areas into the Apertium machine translation system. In: Przepiórkowski A, Piasecki M, Jassem K, Fuglewicz P, editors. Computational linguistics: Applications. Berlin: Springer; 2013. p. 281-92. DOI: 10.1007/978-3-642-34399-5_15
978-3-642-34399-5
http://hdl.handle.net/10230/45627
http://dx.doi.org/10.1007/978-3-642-34399-5_15
Incorporating subject areas into the Apertium machine translation system
oai:repositori.upf.edu:10230/457762020-11-17T02:31:03Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Serra, Xavier
author
1997
When generating musical sound on a digital computer, it is important to have a good model whose parameters provide a rich source of meaningful sound transformations. Three basic model types are in prevalent use today for musical sound generation instrument models, spectrum models, and abstract models. Instrument models attempt to parametrize a sound at its source, such as a violin, clarinet, or vocal tract. Spectrum models attempt to parametrize a sound at the basilar membrane of the ear, discarding whatever information the ear seems to discard in the spectrum. Abstract models, such as FM, attempt to provide musically useful parameters in an abstract formula.
This article addresses the second category of synthesis techniques spectrum modeling. The main advantage of this group of techniques is the existence of analysis procedures that extract the synthesis parameters out of real sounds, thus being able to reproduce and modify actual sounds. Our particular approach is based on modeling sounds as stable sinusoids (partials) plus noise (residual component), therefore analyzing sounds with this model and generating new sounds from the analyzed data. The analysis procedure detects partials by studying the time-varying spectral characteristics of a sound and represents them with time-varying sinusoids. These partials are then subtracted from the original sound and the remaining "residual" is represented as a time-varying filtered white noise component. The synthesis procedure is a combination of additive synthesis for the sinusoidal part, and subtractive synthesis for the noise part.
Serra X. Musical sound modeling with sinusoids plus noise. In: Roads C, Pope S, Picialli A, de Poli G, editors. Musical signal processing. Lisse: Swets & Zeitlinger; 1997. p. 91-122.
9781315078120
http://hdl.handle.net/10230/45776
Musical sound modeling with sinusoids plus noise
oai:repositori.upf.edu:10230/457862020-11-18T02:30:40Zcom_10230_5542com_10230_5922col_10230_8581col_10230_16968
00925njm 22002777a 4500
dc
Llorach, Gerard
author
Agenjo, Javier
author
Blat, Josep
author
Sayago Barrantes, Sergio
author
2019
Within Human-Computer Interaction, there has recently been an important turn to embodied and voice-based interaction. In this chapter, we discuss our ongoing research on building online Embodied Conversational Agents (ECAs), specifically, their interactive 3D web graphics aspects. We present ECAs based on our technological pipeline, which integrates a number of free online editors, such as Adobe Fuse CC or MakeHuman, and standards, mainly BML (Behaviour Markup Language). We claim that making embodiment available for online ECAs is attainable, and advantageous over current alternatives, mostly desktop-based. In this chapter we also report on initial results of activities aimed to explore the physical appearance of ECAs for older people. A group of them (N = 14) designed female ECAs. Designing them was easy and great fun. The perspective on older-adult HCI introduced in this chapter is mostly technological, allowing for rapid online experimentations to address key issues, such as anthropomorphic aspects, in the design of ECAs with, and for, older people.
Llorach G, Agenjo J, Blat J, Sayago S. Web-based embodied conversational agents and older people. In: Sayago S, editor. Perspectives on human-computer interaction research with older people. Cham: Springer International Publishing; 2019. p. 119-35. DOI: 10.1007/978-3-030-06076-3_8
http://hdl.handle.net/10230/45786
http://dx.doi.org/10.1007/978-3-030-06076-3_8
Web-based embodied conversational agents and older people
oai:repositori.upf.edu:10230/468642023-07-03T10:20:16Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Vila-Vidal, Manel, 1991-
author
Tauste Campo, Adrià, 1982-
author
2023
This chapter offers an overview of core topics in epilepsy research from a complex systems perspective, starting from single-node studies up to network theory analysis. In its first part, the chapter reviews the advances in EEG biomarkers for epileptic network mapping in both invasive and scalp EEG, including univariate epileptogenic indexes linear connectivity measures within the multivariate autoregressive model, non-linear correlation measures and graph-theoretical properties. In the second part, the chapter covers some of the initial contributions to characterize network dynamics in epilepsy. Particular attention is paid at the seizure prediction problem, using classic dynamical systems approaches as well as the most recent machine-learning prediction algorithms and network-science studies unraveling signatures of the transition from interictal to ictal activity.
Vila-Vidal M, Tauste Campo A. Mapping epileptic networks with scalp and invasive EEG: applications to epileptogenic zone localization and seizure prediction. In: Scott R, Mahoney M, editors. A Complex Systems Approach to Epilepsy: Concept, Practice, and Therapy. Cambridge: Cambridge University Press; 2023. p. 99-126. DOI: 10.1017/9781108582285.008
http://hdl.handle.net/10230/46864
http://dx.doi.org/10.1017/9781108582285.008
Mapping epileptic networks with scalp and invasive EEG: applications to epileptogenic zone localization and seizure prediction
oai:repositori.upf.edu:10230/523912022-02-03T02:33:13Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Sánchez Reina, Jesús Roberto
author
Theophilou, Emily
author
Hernández Leo, Davinia
author
Medina Bravo, Pilar, 1966-
author
2021
Sánchez-Reina JR, Theophilou E, Hernández-Leo D, Medina-Bravo P. The power of beauty or the tyranny of algorithms. How do teens understand body image on instagram? En: Castillo-Abdul B, García-Prieto V, editors. Prosumidores emergentes: redes sociales, alfabetización y creación de contenidos. Sevilla; Editorial Dykinson S.L.; 2021. p. 429-50.
http://hdl.handle.net/10230/52391
The power of beauty or the tyranny of algorithms. How do teens understand body image on instagram?
oai:repositori.upf.edu:10230/537292022-07-15T01:32:21Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Rodríguez, Juan Antonio
author
Comas, Joaquim
author
Binefa i Valls, Xavier
author
2021
Virtual learning and education have become crucial during the COVID19 pandemic, which has forced a rethink by teachers and educators into designing
online content and the indirect interaction with students. In an face-to-face class,
some visual cues help the teacher recognize the engagement level of students, while
the main weakness of the online approach is the lack of feedback that the teacher
has about the learning process of the students. In this paper, we introduce a novel
framework able to track the learning states, or LS, of the students while they are
watching a piece of knowledge-based content. Specifically, we extract four learning states: Interested, Bored, Confused or Distracted. Finally, to demonstrate the
system’s capability, we collected a reduced database to analyze the affective state
of the subjects. From these preliminary results, we observe abrupt changes in the
LS of the audience when there are abrupt changes in the narrative of the video, indicating that well-structured and bounded information is strongly related with the
learning behaviour of the students.
Rodríguez JA, Comas J, Binefa X. Affective state-based framework for e-learning systems. In: Villaret M, Alsinet T, Fernández C, Valls A, editors. Artificial intelligence research and development: proceedings of the 23rd International Conference of the Catalan Association for Artificial Intelligence. Amsterdam: IOS Press; 2021. p. 357-66. DOI: 10.3233/FAIA210155
http://hdl.handle.net/10230/53729
http://doi.org/10.3233/FAIA210155
Affective state-based framework for e-learning systems
oai:repositori.upf.edu:10230/564962023-04-20T01:31:16Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Hernández Leo, Davinia
author
Amarasinghe, Ishari
author
Beardsley, Marc
author
Hakami, Eyad
author
Ruiz Garcia, Aurelio
author
Santos Rodríguez, Patrícia
author
2023
This chapter unfolds some elements of responsible research in the educational technology field and provides examples about how these elements have been considered in initiatives by the Interactive and Distributed Technologies for Education (TIDE) research group at Universitat Pompeu Fabra in Barcelona. First, it focuses on open science, an ongoing movement that promotes, on the one hand, transparent and frequent open-access updates of the research progress and the collected data and, on the other hand, reproducible, accurate, and verifiable research, bringing benefits for the individual researchers, the research community, and the society. Second, the chapter discusses ethics perspectives in educational technology research, relevant when collecting and sharing data and also in the design and development of technologies, especially when they are based on data analytics or artificial intelligence techniques. The latter aspects relate to the capacity of educational software systems to support human agency and preserve human well-being.
Hernández-Leo D, Amarasinghe I, Beardsley M, Hakami E, Ruiz García A, Santos P. Responsible educational technology research: from open science and open data to ethics and trustworthy learning analytics. In: Raffaghelli JE, Sangrà A, editors. Data cultures in higher education. Cham: Springer; 2023. p. 179–200. DOI: 10.1007/978-3-031-24193-2_7
978-3-031-24192-5
1571-0378
http://hdl.handle.net/10230/56496
http://dx.doi.org/10.1007/978-3-031-24193-2_7
Responsible educational technology research: from open science and open data to ethics and trustworthy learning analytics
oai:repositori.upf.edu:10230/589752024-02-08T02:30:41Zcom_10230_5922com_10230_5542col_10230_16968
00925njm 22002777a 4500
dc
Hernández Leo, Davinia
author
2022
Hernández-Leo D. Directions for the responsible design and use of AI by children and their communities: examples in the field of Education. In: Charisi V, Chaudron S, Di Gioia R, Vuorikari R, Escobar Planas M, Sanchez Martin JI, Gomez Gutierrez E. Artificial intelligence and the rights of the child: towards an integrated agenda for research and policy. Sevilla: European Union; 2022. p. 73-4. DOI: 10.2760/012329
9789276518372
http://hdl.handle.net/10230/58975
http://dx.doi.org/10.2760/012329
Directions for the responsible design and use of AI by children and their communities: examples in the field of Education