Lecture Notes in Computer Science
Lecture Notes in Computer Science
Language Representation
1 Introduction
Quantum mechanics is a very successful scientific theory for making pre-
dictions about systems with inherent ambiguity in them. That natural
language bears similarities with such a system is at least plausible. Re-
cent advances in theory and experimentation to apply quantum mechanics
to non-quantum domains include the use of quantum algorithms to ad-
dress, or to more efficiently solve, problems in such domains (including
contrasts between classical vs. quantum methods), such as applications
of artificial intelligence, information retrieval, and language modelling.
The quantum metaphor promises improved methodologies to capture
the subtleties and ambiguities of human language, resulting in optimised
algorithms for text processing. The purpose of SQUALAR is to investigate
methods borrowed from the field of quantum mechanics in a wide range of
large-scale language technology applications by seeking a match between
quantum algorithms and heterogeneous computing.
To this end, a scalable environment is a must. Latest trends indicate
the rise of a heterogeneous platform in which multi-core central processing
units (CPUs) and graphics processing units (GPUs) work together in a
distributed-memory parallelism. CPU-based parallelism has been utilized
for decades, and while not without its own problems, it is a mature field
and multicore CPUs enable developing faster algorithms with reasonable
effort. In this paradigm, there is a considerable overhead on dividing the
problem, distributing the bits along a small number of CPU cores, then
collecting and merging results. This type of parallelism is available in a
wide range of programming languages, although the source code needs to
be modified to some extent. GPU-based parallelism is a completely differ-
ent approach. The overhead of splitting the work is minimal, the number
of cores is massive, but the kind of computations that can be split is
limited to a simple, single-pass operation. This heterogeneous computing
environment has to be studied at different levels to find scalable imple-
mentations: low-level linear algebra, numerical methods, kernel methods
and manifold learning are candidates for testing, as well as higher level
load distribution such as MapReduce [1]. The constraints are as follows:
2 Heterogeneous computing
With the above plethora of approaches available for testing, the funda-
mental task of SQUALAR is bridging scalable linear algebra and nu-
merical methods that are widely used in scientific computing with the
emerging theories in quantum interaction to enable practical, real-world
language technology applications.
The hardware and basic software infrastructure is what we described
in section 2: a distributed system consisting of heterogeneous nodes which
combine multicore CPUs and GPUs (top part of Figure 1). Since hardware
virtualization is already at consumer level, the distributed system can be
either a privately owned cluster or grid, or a high-performance computing
cloud provided by a third-party.
Without going into details, algorithms in linear algebra are the most
obvious candidates for acceleration on graphics hardware (middle part of
Figure 1, left). Vector space models of semantics can be implemented by
accelerated BLAS libraries [7, 9], including operator algebra for semantic
inference [22, 24]. Matrix decompositions and dimension reduction that
also play an important role in understanding semantics are currently lim-
ited to matrices of limited sizes [10]. Convolution, which plays an im-
portant part in encoding term positions [29, 35], can be mapped to the
frequency domain by Fourier transformation, where the operation simpli-
fies to a simple multiplication. Fast Fourier transformation on GPUs is
a classical area for acceleration [55]. More complex examples in acceler-
ated quantum methods [56, 57] and related visualization [58] are awaiting
appropriate metaphors in language processing.
Approaching from existing language processing algorithms, if a suf-
ficient metaphor cannot be found or if the method does not lend itself
General-purpose
Parallel Distributed
computing on
programming on systems: grids
graphics
multicore CPUs and clouds
processing units
Computing environment
Supporting Technologies
Applications
easily to any of the methods described above, lower level libraries can be
used for developing multithreaded, GPU-based implementations (middle
part of Figure 1, right and middle).
If we focus on a single computer, we will be able to perform oper-
ations several folds faster, gaining new insights on language technology
(bottom part of Figure 1, left). By providing a high-level load balancing
mechanism, the potential of compute and data-intensive processing can
be released in a distributed environment for web-scale applications (bot-
tom part of Figure 1, middle). Some machine learning algorithms, such
as support vector machines, have already been adopted to graphics hard-
ware [59]. Combining these with the above, we gain powerful text mining
applications (bottom part of Figure 1, right). Since Information Retrieval
has already began experimenting with a wide range of quantum theory
based metaphors, this field has the most to benefit.
5 Conclusion
6 Acknowledgement
References
1. Dean, J., Ghemawat, S.: MapReduce: Simplified data processing on large clusters.
In: Proceedings of OSDI-04, 6th International Symposium on Operating Systems
Design & Implementation, San Francisco, CA, USA, ACM Press, New York, NY,
USA (2004)
2. Lin, J., Dyer, C.: Data-Intensive Text Processing with MapReduce. Morgan &
Claypool (2010)
3. Cavanagh, J., Potok, T., Cui, X.: Parallel latent semantic analysis using a graphics
processing unit. In: Proceedings of GECCO-09, 11th Annual Conference Compan-
ion on Genetic and Evolutionary Computation Conference: Late Breaking Papers,
Montreal, QC, Canada, ACM Press, New York, NY, USA (2009) 2505–2510
4. Ding, S., He, J., Yan, H., Suel, T.: Using graphics processors for high performance
IR query processing. In: Proceedings of WWW-09, 18th International Conference
on World Wide Web, Spain, Madrid, ACM Press, New York, NY, USA (2009)
421–430
5. Zhang, Y., Mueller, F., Cui, X., Potok, T.: Large-scale multi-dimensional docu-
ment clustering on GPU clusters. In: Proceedings of IDPDS-10, 24th International
Parallel and Distributed Computing Symposium, Atlanta, GA, USA, IEEE Com-
puter Society (2010)
6. Byna, S., Meng, J., Raghunathan, A., Chakradhar, S., Cadambi, S.: Best-effort
semantic document search on GPUs. In: Proceedings of GPGPU-10, 3rd Workshop
on General-Purpose Computation on Graphics Processing Units, New York, NY,
USA, ACM (2010) 86–93
7. Krüger, J., Westermann, R.: Linear algebra operators for GPU implementation
of numerical algorithms. In: Proceedings of SIGGRAPH-05, 32nd International
Conference on Computer Graphics and Interactive Techniques, Los Angeles, CA,
USA, ACM Press, New York, NY, USA (2005) 234–242
8. Galoppo, N., Govindaraju, N., Henson, M., Bondhugula, V., Larsen, S., Manocha,
D.: Efficient numerical algorithms on graphics hardware. In: Proceedings of EDGE-
06, Workshop on Edge Computing Using New Commodity Architectures, Chapel
Hill, NC, USA (2006)
9. Barrachina, S., Castillo, M., Igual, F., Mayo, R., Quintana-Orti, E.: Evaluation
and tuning of the level 3 CUBLAS for graphics processors. In: Proceedings of
IPDPS-08, 22nd International Symposium on Parallel and Distributed Processing,
Miami, FL, USA, IEEE (2008) 1–8
10. Lahabar, S., Narayanan, P.: Singular value decomposition on GPU using CUDA.
In: Proceedings of IPDPS-09, 23rd International Symposium on Parallel and Dis-
tributed Processing, Rome, Italy, IEEE (2009)
11. Brodtkorb, A., Dyken, C., Hagen, T., Hjelmervik, J., Storaasli, O.: State-of-the-art
in heterogeneous computing. Scientific Programming 18(1) (2010) 1–33
12. Kirk, D., Hwu, W.: Programming massively parallel processors: A hands-on ap-
proach. (2009)
13. Jiménez, V., Vilanova, L., Gelado, I., Gil, M., Fursin, G., Navarro, N.: Predic-
tive runtime code scheduling for heterogeneous architectures. High Performance
Embedded Architectures and Compilers (2009) 19–33
14. Lee, S., Min, S., Eigenmann, R.: OpenMP to GPGPU: a compiler framework
for automatic translation and optimization. In: Proceedings of PPOPP-09, 14th
Symposium on Principles and Practice of Parallel Programming, ACM Press, New
York, NY, USA (2009) 101–110
15. Luk, C., Hong, S., Kim, H.: Qilin: Exploiting parallelism on heterogeneous multi-
processors with adaptive mapping. In: MICRO-42, 42nd Annual IEEE/ACM In-
ternational Symposium on Microarchitecture, New York, NY, USA, IEEE (2009)
45–55
16. Phillips, J., Stone, J., Schulten, K.: Adapting a message-driven parallel applica-
tion to GPU-accelerated clusters. In: Proceedings of SC-08, 21st Conference on
Supercomputing, Austin, TX, USA, IEEE Press (2008) 1–9
17. Kuhn, B., Petersen, P., O’Toole, E.: OpenMP versus threading in C/C++. Con-
currency: Practice and Experience 12(12) (2000) 1165–1176
18. Koop, M., Sur, S., Gao, Q., Panda, D.: High performance MPI design using unre-
liable datagram for ultra-scale InfiniBand clusters. In: Proceedings of ISC-06, 21st
Annual International Conference on Supercomputing, Dresden, Germany, ACM
(2006) 180–189
19. : NVida Compute Unified Device Architecture Best Practices Guide 3.2 (2010)
20. Shirahata, K., Sato, H., Matsuoka, S.: Hybrid map task scheduling on GPU-based
heterogeneous clusters. In: Proceedings of CloudCom-10, The 2nd International
Conference on Cloud Computing, Indianapolis, IN, USA (2010)
21. Aerts, D., Aerts, S., Broekaert, J., Gabora, L.: The violation of bell inequalities
in the macroworld. Foundations of Physics 30(9) (2000) 1387–1414
22. Widdows, D., Peters, S.: Word vectors and quantum logic: Experiments with
negation and disjunction. In: Proceedings of MoL-03, 8th Mathematics of Language
Conference. Volume 8., Bloomington, IN, USA (2003) 141–54
23. Aerts, D., Czachor, M.: Quantum aspects of semantic analysis and symbolic ar-
tificial intelligence. Journal of Physics A: Mathematical and General 37 (2004)
L123–L132
24. van Rijsbergen, C.J.: The Geometry of Information Retrieval. Cambridge Univer-
sity Press, New York, NY, USA (2004)
25. Widdows, D.: Geometry and meaning. (2004)
26. Bruza, P., Widdows, D., Woods, J.: A quantum logic of down below. In Engesser,
K., Gabbay, D., Lehmann, D., eds.: Handbook of Quantum Logic and Quantum
Structures. Volume 2. Elsevier (2009)
27. Kitto, K.: Why quantum theory? In: Proceedings of QI-08, 2nd International
Symposium on Quantum Interaction, Oxford, UK, Springer-Verlag (2008) 11–18
28. Lyons, J.: Semantics. Cambridge University Press, New York, NY, USA (1977)
29. Jones, M., Mewhort, D.: Representing word meaning and order information in a
composite holographic lexicon. Psychological review 114(1) (2007) 1–37
30. Bruza, P., Woods, J.: Quantum collapse in semantic space: interpreting natural
language argumentation. In: Proceedings of QI-08, 2nd International Symposium
on Quantum Interaction, Oxford, UK, College Publications (2008)
31. Widdows, D.: Semantic vector products: Some initial investigations. In: Proceed-
ings of QI-08, 2nd International Symposium on Quantum Interaction, Oxford, UK,
College Publications (2008)
32. Kanerva, P., Kristofersson, J., Holst, A.: Random indexing of text samples for
latent semantic analysis. In: Proceedings of CogSci-00, 22nd Annual Conference
of the Cognitive Science Society. Volume 1036., Philadelphia, PA, USA (2000)
33. Sahlgren, M.: An introduction to random indexing. In: Proceedings of TKE-05,
Methods and Applications of Semantic Indexing Workshop at the 7th International
Conference on Terminology and Knowledge Engineering, Copenhagen, Denmark,
Citeseer (2005)
34. Sahlgren, M., Holst, A., Kanerva, P.: Permutations as a means to encode order in
word space. In: Proceedings of CogSci-08, 30th Annual Meeting of the Cognitive
Science Society, Washington, DC, USA (2008)
35. De Vine, L., Bruza, P.: Semantic oscillations: Encoding context and structure in
complex valued holographic vectors. In: Proceedings of QI-10, 4th Symposium on
Quantum Informatics for Cognitive, Social, and Semantic Processes, Arlington,
VA, USA (2010) 11–13
36. Mitchell, J., Lapata, M.: Vector-based models of semantic composition. In: Pro-
ceedings of ACL-08, 46th Annual Meeting of the Association for Computational
Linguistics, Columbus, Ohio, ACL, Morristown, NJ, USA (2008) 236–244
37. Song, D., Lalmas, M., van Rijsbergen, C., Frommholz, I., Piwowarski, B., Wang,
J., Zhang, P., Zuccon, G., Bruza, P., Arafat, S., et al.: How quantum theory is
developing the field of Information Retrieval. In: Proceedings of QI-10, 4th Sym-
posium on Quantum Informatics for Cognitive, Social, and Semantic Processes,
Arlington, VA, USA (2010) 105–108
38. Humphreys, M., Bain, J., Pike, R.: Different ways to cue a coherent memory sys-
tem: A theory for episodic, semantic, and procedural tasks. Psychological Review
96(2) (1989) 208–233
39. Wiles, J., Halford, G., Stewart, J., Humphreys, M., Bain, J., Wilson, W.: Tensor
models: A creative basis for memory retrieval and analogical mapping. In Dartnall,
T., ed.: Artificial intelligence and creativity. Kluwer Academic (1994) 145–159
40. Plate, T.: Holographic reduced representations: Convolution algebra for composi-
tional distributed representations. In: Proceedings of IJCAI-91, 12th International
Joint Conference on Artificial Intelligence, Syndey, Australia, Citeseer (1991) 30–
35
41. Plate, T.: Holographic reduced representations. IEEE Transactions on Neural
Networks 6(3) (1995) 623–641
42. Antonellis, I., Gallopoulos, E.: Exploring term-document matrices from matrix
models in text mining. In: Proceedings of SDM-06, Text Mining Workshop in
conjuction with the 6th SIAM International Conference on Data Mining, Bethesda,
MD, USA (2006)
43. Rudolph, S., Giesbrecht, E.: Compositional matrix-space models of language. In:
Proceedings of ACL-10, 48th Annual Meeting of the Association for Computational
Linguistics, Uppsala, Sweden, Association for Computational Linguistics (2010)
907–916
44. Rölleke, T., Tsikrika, T., Kazai, G.: A general matrix framework for modelling
information retrieval. Information Processing & Management 42(1) (2006) 4–30
45. Swen, B.: A sense matrix model for information retrieval. Technical report, Tech-
nical report TR-2004-2 of ICL-PK (2004)
46. Novakovitch, D., Bruza, P., Sitbon, L.: Inducing shades of meaning by matrix
methods: a first step towards thematic analysis of opinion. In: Proceedings of
SEMAPRO-09, 3rd International Conference on Advances in Semantic Processing,
Sliema, Malta, IEEE (2009) 86–91
47. Jones, M., Kintsch, W., Mewhort, D.: High-dimensional semantic space accounts
of priming. Journal of memory and language 55(4) (2006) 534–552
48. Zuccon, G., Azzopardi, L., Rijsbergen, C.: Semantic spaces: Measuring the distance
between different subspaces. In: Proceedings of QI-09, 3rd International Sympo-
sium on Quantum Interaction, Saarbruecken, Germany, Springer-Verlag (2009)
225–236
49. Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by
latent semantic analysis. Journal of the American Society for Information Science
41(6) (1990) 391–407
50. Wittgenstein, L.: Philosophical Investigations. Blackwell Publishing, Oxford, UK
(1967)
51. Harris, Z.: Distributional structure. In Harris, Z., ed.: Papers in structural and
transformational Linguistics. Formal Linguistics. Humanities Press, New York,
NY, USA (1970) 775–794
52. Peirce, C.: Logic as semiotic: The theory of signs. In Peirce, C., Buchler, J., eds.:
Philosophical Writings of Peirce. Dover Publications (1955) 98–119
53. Frege, G.: Sense and reference. The philosophical review 57(3) (1948) 209–230
54. Lund, K., Burgess, C.: Producing high-dimensional semantic spaces from lexical
co-occurrence. Behavior Research Methods Instruments and Computers 28 (1996)
203–208
55. Govindaraju, N., Lloyd, B., Dotsenko, Y., Smith, B., Manferdelli, J.: High per-
formance discrete Fourier transforms on graphics processors. In: Proceedings of
SC-08, 21st Conference on Supercomputing, Austin, TX, USA, IEEE (2008)
56. Ufimtsev, I., Martı́nez, T.: Graphical processing units for quantum chemistry.
Computing in Science & Engineering 10(6) (2008) 26–34
57. Watson, M., Olivares-Amaya, R., Edgar, R., Aspuru-Guzik, A.: Accelerating corre-
lated quantum chemistry calculations using graphical processing units. Computing
in Science & Engineering 12(4) (2010) 40–51
58. Stone, J., Saam, J., Hardy, D., Vandivort, K., Hwu, W., Schulten, K.: High per-
formance computation and interactive display of molecular orbitals on GPUs and
multi-core CPUs. In: Proceedings of GPGPU-09, 2nd Workshop on General Pur-
pose Processing on Graphics Processing Units, Washington, DC, USA, ACM (2009)
9–18
59. Catanzaro, B., Sundaram, N., Keutzer, K.: Fast support vector machine training
and classification on graphics processors. In McCallum, A., Roweis, S., eds.: Pro-
ceedings of ICML-08, 25th Annual International Conference on Machine Learning,
Helsinki, Finland, Omnipress (2008) 104–111