Digital Fourier Optics: Haldun M. Ozaktas and David A. B. Miller
Digital Fourier Optics: Haldun M. Ozaktas and David A. B. Miller
Analog Fourier optical processing systems can perform important classes of signal processing operations in parallel, but suffer from limited accuracy. Digitaloptical equivalents of such systems could be built that share many features of the analog systems while allowing greater accuracy. We show that the digital equivalent of any system consisting of an arbitrary number of lenses, lters, spatial light modulators, and sections of free space can be constructed. There are many possible applications for such systems as well as many alternative technologies for constructing them; this paper stresses the potential of free-space interconnected active-device-plane-based optoelectronic architectures as a digital signal processing environment. Implementation of the active-device planes through hybridization of optoelectronic components with silicon electronics should allow the realization of systems whose performance exceeds that of purely electronic systems. r 1996 Optical Society of America
1.
Introduction
The term Fourier optics is used in several different ways. Sometimes it is used to refer to the theory of centered optical systems based on the Fresnel approximation and the assumption of thin lenses and transmissive elements. The ease with which the Fourier transform and various signal processing operations can be realized with these systems has led to the terms also being used almost synonymously with optical information processing or optical signal processing.1,2 These systems can exhibit spacebandwidth products that exceed 106 without excessive difficulty, a capacity that can be almost fully utilized in realizing space-invariant 1convolution-type2 operations and less efficiently for spacevariant operations.35 These operations are realized in the time it takes light to propagate through the system, so that the overall system speed is often limited only by the intrinsic delay Td of the input output devices. What is most noteworthy about these systems is the way in which a large number of parallel channels undergo several consecutive operations in pipeline fashion, with opportunity for global percolation of information at every stage of the pipeline. Despite these advantages, such analog
H. M. Ozaktas is with the Department of Electrical Engineering, Bilkent University, 06533 Bilkent, Ankara, Turkey. D. A. B. Miller is with AT&T Bell Laboratories, 101 Crawfords Corner Road, Holmdel, New Jersey 07733. Received 3 July 1995; revised manuscript received 27 October 1995. 0003-6935@96@081212-08$06.00@0 r 1996 Optical Society of America 1212 APPLIED OPTICS @ Vol. 35, No. 8 @ 10 March 1996
optical information processing systems suffer from certain serious disadvantages, common to most analog systems. In particular, the dynamic range in amplitude seems to be limited to ,2 orders of magnitude, disallowing signal processing operations that require greater accuracy or the concatenation of many stages to be performed. Recently there has been signicant progress in making so-called smart pixels: two-dimensional 12-D2 arrays of electronic processing units, each with optical inputs and outputs.6 Such smart pixels have the potential to capitalize on the advanced state of certain electronics technologies together with the advantages of optics as an interconnection medium.7 One signicant challenge with such electronic arrays is to devise concepts and architectures that provide a platform for realizing this potential. In this paper we show that the basic architectural concepts of analog Fourier optics allow the specication of just such a platform, which enables the construction of a broad range of interesting digital signal processing 1DSP2 systems. It is important to emphasize that we are not proposing merely to insert smart-pixel arrays in existing Fourier optical systems. Rather, we show that the digital optical equivalent of any analog Fourier optical system consisting of lenses, sections of free space, passive or active transmissive elements, etc., can be constructed with properly chosen 1or programmed2 smartpixel arrays interconnected in succession with xed and regular connection patterns. Figure 1 schematically depicts the general class of systems being considered. The construction of such systems has already been demonstrated and investigated for
applications such as telecommunications switching fabrics8 and image processing.9 In an analogoptical processing system, each number is represented by the amplitude or the intensity of light at a certain point at a given time. In the digitaloptical systems discussed here, each complex number is represented by a binary string. Physically, this binary string will usually be represented bit serially by the onoff modulated intensity of a light beam 1or possibly differential pairs of light beams2. 1Of course, we could also choose to use a group of parallel light beams to represent the binary digits in parallel, but this is not likely because, given the present technological constraints of smart-pixel technologies, we are more likely to run at high bit rates with a moderate number of beams that at low bit rates with a large number of beams.2 In analog systems, it is possible to perform many interesting operations by using simple lenses and transparencies. Digital systems require more sophisticated optics for mapping 2-D arrays of distinct light beams according to certain discrete regular connection patterns 1such as the Banyan or perfect shuffle2 between discrete optical inputoutput pixels on consecutive processing planes. The benets of digitaloptical systems with respect to analogoptical systems are unlimited accuracy and cascadability and much greater exibility in implementing different kinds of functions and algorithms, including those that involve nonlinear operations and operations that require memory. The major benet with respect to purely electronic systems is the ability to implement high-speed, high-density global interconnections in three dimensions with less overall power dissipation.
2. Digital Equivalents of Analog Components and Operations
We now discuss digital equivalents of common analogoptical operations and components. We rst describe the equivalent of a dynamically programmable thin transmissive element, better known as a spatial light modulator 1SLM2. Such an element can be realized by using many of the several
so-called smart-pixel technologies; we assume use of the hybrid self-electro-optic-effect-device 1SEED2 technology that we nd particularly promising.10 In this technology, quantum-well diodes, which are capable of acting as high-speed optical detectors 1for input2 and optical modulators 1for output2, are bonded to silicon integrated circuits. For the purpose of this paper, it is sufficient to note that this is essentially a technology that allows optical input and output from a mainstream silicon VLSI circuit. Current technology requires optical inputs or outputs to be spaced by approximately tens of micrometers or more apart. If these optical inputs and outputs are arranged in the form of a regular array of pixels, we essentially obtain a pixelated SLM with plenty of silicon circuit area per pixel. In operation, optical bit streams representing quantized numbers are detected and read in by each pixel. The circuitry corresponding to each pixel modies this number according to purpose. Then the modied number is reconverted into an optical bit stream. Note that nonlinear operations as well as operations with memory are possible. The exibility and the programmability of these devices far exceed those of their analog counterparts. Hence, in addition to allowing improved accuracy, the digital approach will allow certain operations that are inherently difficult or impossible with analog systems, such as buffering, consecutive frame differencing, etc. A thin spherical lens is simply a special case of a thin transmissive element. Thus its digital equivalent may be realized by the use of a hardwired version of the SLMs discussed in the paragraph above. 1If desired, a programmable SLM would allow us to realize a lens with adjustable focal length, pupil size, etc.2 The complex numbers representing the amplitude of light at each pixel will simply be multiplied by the appropriate phase factors characteristic of a lens. 1In contrast with a real lens, this one will have no aberrations, but it will of course exhibit quantization error.2 There is no direct analog of free-space propagation, although we see below that it is possible to simulate free-space propagation. In any case, it is important to note that lenses and free-space propagation are not ends but means. They are the basic building blocks of analog Fourier optics systems, but need not necessarily be the basic building blocks of digital Fourier optics systems. It is more useful to look at the classes of operations performed by analog optical systems at a higher level and to understand how to perform these operations with digitaloptical systems, rather than to try to imitate analog systems at the lowest level. One of the most important operations in signal processing is the Fourier transform. The digital N 3 N point 2-D discrete Fourier transform 1DFT2 can be realized by the use of an ,log2 N-stage 2-D Banyan network.11 1Figure 2 shows a one-dimensional 11-D2 Banyan, which is easier to comprehend. The 2-D DFT can also be realized in two steps
10 March 1996 @ Vol. 35, No. 8 @ APPLIED OPTICS 1213
Fig. 2. 1-D Banyan 1buttery2 network for performing the DFT 1N 5 82. The 2-D Banyan is difficult to draw but is analogous.
involving only 1-D transforms, performed in parallel.2 Given the basic similarity of this type of network with some of the switching fabrics demonstrated in Ref. 8 and the emerging yields and processing power of hybrid SEEDs, we infer that a 16 3 16 DFT could likely be realized with technology being demonstrated now 1as we discuss below2. Of course, by combining two Fourier-transform stages with a SLM in between, we obtain the ability to realize convolutions and correlations by frequencydomain ltering 1these are also among the most important operations in signal processing2. The exibility of the digital approach will also allow most other unitary image transforms12 to be realized as well, with similar log2 N-stage architectures. Because such transforms are used in applications such as transform-domain image compression, it should be possible to contemplate digitaloptical architectures for this and similar purposes. The threedimensional data ow possible with optics should allow one to surpass the performance of fully electronic approaches. Analogoptical approaches would, in most cases, not be able to offer sufficient dynamic range for this type of application.
3. Performance Comparison with Electronic Systems
We can give some estimates of the capabilities of such digitaloptoelectronic processing systems and compare them with purely electronic systems. Let us consider the task of computing an N 5 16 3 16 point 2-D DFT. 1This is comparable with the task of computing an N 5 256 point 1-D DFT.2 The optoelectronic system will consist of log2 N silicon device planes with optoelectronic inputs outputs 1smart-pixel arrays2 of the kind described in Ref. 10, interconnected according to the buttery 1Banyan2 pattern 1or equivalent2. There are many ways of designing such a system. Here we consider one example in its broad outlines, omitting most details. Let us rst concentrate on one of the silicon device planes. A complex multiplier is the dominant part of a radix-2 buttery element used in Fouriertransform processors. For example, the use of a standard serialparallel complex multiplication ar1214 APPLIED OPTICS @ Vol. 35, No. 8 @ 10 March 1996
chitecture with a transistor count, as enumerated by Smith and Denyer,13 would require 9208 transistors in a complementary metal-oxide semiconductor 1CMOS2 for multiplying two complex numbers with 16-bit real and imaginary parts. A 16 3 8 array of such multipliers 1corresponding to a 16 3 16 array of complex numbers2 would therefore require approximately 1.2 million transistors, a number that is within the capabilities of current silicon technology. After the necessary complex multiplications are performed at a given plane, the array of 16 3 16 complex numbers will be optically mapped to the next plane according to the buttery pattern, which involves a fan-in@fan-out factor of 2. Thus for each of the 16 3 16 complex numbers, we need two real and two imaginary inputs and two real and two imaginary outputs, adding up to a total of eight optical inputs outputs per complex number. 1A separate input for the multiplying weight is not necessary, as these weights are xed.2 Allowing for differential operation, in which each real or imaginary part is represented by two light beams, we would therefore require 16 3 16 3 8 3 2 5 4096 optical inputoutput devices. The realization of this many optoelectronic devices appears to be within the capability of the hybrid SEED technology currently being developed.10 Such multiplier circuits can be run in a pipelined mode 1see, for instance, Ref. 142, being able to accept a new input number every n bits 1where n is the number of bits in the real and the imaginary parts2. Hence, for 16-bit numbers, a new number can be fed in every 16 clock cycles. Therefore, assuming a 100-MHz clock rate, a completely new 16 3 16 array of complex numbers could be fed in to be Fourier transformed every 160 ns. The entire optoelectronic Fourier-transforming system would require log2116 3 162 5 8 stages of such silicon chips optically interconnected according to the buttery 1Banyan2 1or other equivalent2 interconnection network. The latency in each individual multiplier is less than 2n cycles14 1when the number of bits in the weight is less than n2, and hence the total latency contributed by the multipliers would be less than 8 3 2 3 16 cycles 5 256 cycles 5 2.56 s. There would be some additional latency contributed by the digital adders and by the optical inputoutput circuits, although it should be relatively small. To sum up, a 16 3 16 5 256 point DFT can be computed in 2.56 s, with a new set of data being pipelined every 160 ns. The total amount of information entering or exiting the system per unit time is 16 3 16 3 2 3 16@160 ns 5 51.2 Gbits@s 5 6.4 Gbytes@s. Because a complex multiplication corresponds to four real multiplications, each chip performs 4 3 16 3 8 5 512 xedpoint multiply-and-add operations every 160 ns, corresponding to 3.2 billion such 16-bit xed-point operations per second. The entire eight-stage system would perform 25.6 billion xed-point operations per second when the pipeline is full. For comparison, a current state-of-the-art elec-
tronic DSP chip 1SHARP LH91242, optimized for fast Fourier transforms, can perform a benchmark 1024point complex Fourier transform in ,80 s in a single DSP chip conguration and in 26 s in a three DSP chip board. For a 256-point transform 1equivalent to the 16 3 16 transform discussed above2, we can scale these times down by the factor 1256 log2 2562@11024 log2 10242 5 0.2 to get a time of 5.2 s for the three DSP chip board. This gure is of the same order as the 2.56-s latency estimate for the optoelectronic system. However, this electronic system is not ready to receive a new input until the end of the 5.2 s. Thus, in situations in which many consecutive sets of data are ready waiting to be Fourier transformed, a new set can be transformed every 160 ns with the optoelectronic system, but only every 5.2 s with the electronic system. The total amount of time needed to transform M sets of data would be M 3 5.2 s with the electronic system and M 3 160 ns 1 2.56 s with the optoelectronic system. For larger values of M, this represents more than a 30-fold increase in speed. The optoelectronic system exhibits greater advantages as we consider larger-size DFTs. For instance, we could conceive of a system that computes 32 3 32 5 1024 2-D DFTs, realized by tiling four of the previously discussed chips side by side in each stage and extending the number of stages from eight to 10. 1In principle there is no limit on the number of chips we can tile side by side to increase the size of the DFTs we can handle, because there need be no electrical data connections between any of the chips. In the electronic case, it would be hard to improve performance proportionately by increasing the number of DSP chips used, because of the difficulty of efficiently realizing electrical interconnections between the chips.2 This system can compute a 1024point DFT in 3.2 s, with a new set of data being pipelined every 160 ns. In comparison, the electronic three DSP chip board mentioned above could perform a 1024-point DFT in 26 s. We see that for 1024-point DFTs, the optoelectronic system offers an advantage in terms of latency by approximately an order of magnitude, in addition to offering more than a 160-fold advantage in terms of pipelining rate. The fully electronic system cannot be pipelined because it cannot receive a new input set before it has nished the entire DFT calculation. This is because the same radix unit in the chip is time multiplexed; it is used, with different coefficients, to perform every radix operation in the entire owgraph 1Fig. 22. The circuits are essentially working at full capacity all the time, leaving no room for pipelining. This way of using the electronic chip1s2 is convenient because it reduces the amount of inputoutput to a level that can be handled by the electronic chips and interconnection technology, while still making good use of the silicon. Attempting to pipeline or parallelize the electronics to emulate the capabilities of the proposed optical architecture will
simply resurrect the problem of providing sufficient aggregate interconnection bandwidth. If one wishes to perform a 16 3 16 complex Fourier transform every 160 ns, then one needs to circulate information at a rate of at least 51.2 Gbits@s through the system, regardless of whether or how the system is pipelined or parallelized. Pipelining or parallelization can help in spreading the problem to be solved over more silicon if necessary, but cannot reduce the required aggregate data bandwidth. Thus, for instance, we may reduce the total time of computation of M sets of 1024 DFTs down to 26 s by introducing M identical sets of electronic hardware, but it is unlikely that we can further signicantly reduce the time of computation needed for a single transform through parallelism and pipelining. The ultimate pipelined and parallelized electronic implementation of the fast Fourier-transform algorithm is the direct realization of the full buttery graph on a single large wafer. Even with this approach, it is possible to show that the advantage of the optoelectronic system in terms of time of computation and pipelining rate becomes more and more signicant as we consider DFTs of larger and larger sizes, a result that is not surprising when one considers the planar nature of VLSI circuits and the inferior scaling behavior of electrical lines that is due to resistance.15,16 Of course, for a completely fair comparison with the purely electronic system mentioned, the optical system described above has yet to be made. This would require signicant engineering and could be prohibitively expensive for certain practical applications. On the other hand, over time the optical components will become progressively better engineered and less expensive. Furthermore, as silicon integrated circuits become more capable, so also do the optoelectronic hybrid processors we have described. In many previous optical computing systems, the demonstrated or projected performance gures were never able to keep up with those of constantly improving silicon circuitry. Here, such improvements help the optoelectronic systems just as much. The comparisons we have just made for Fourier transforming systems more or less carry over to convolution and correlation systems. This is because the implementation of dynamic complex spatial lters also requires arrays of complex multipliers, for which similar considerations, discussions, and calculations as given above apply. 1Strictly, twice as many multipliers would be needed as for the Fourier-transform stages, which have only one multiplier for every two complex numbers. Thus approximately twice as many transistors would be required in each plane.2
4. Digital Equivalents of Arbitrary Analog Systems
Other elementary operations that are frequently used in analog Fourier optics are broadcast and integration. The equivalents of these operations can also be realized in log2 N stages, as shown in
10 March 1996 @ Vol. 35, No. 8 @ APPLIED OPTICS 1215
real constants independent of x and x8. Imaging, Fourier transforming, free-space propagation, and passage through a lens are all special cases of the above. Now, it is known that 1perhaps with appropriate scaling of the transverse coordinates2 any such system can be decomposed in any of the following forms: qp1a, b, g2 5 lns1 f12 ft lns1 f22, or qp1a, b, g2 5 lns1 f32 frft1a2 lns1 f42,
Fig. 3. 1-D tree network for broadcast. analogous. The 2-D version is
122
132
Figs. 3 and 4. Note that in realizing the equivalent of integration, addition is not performed by superposition of light beams, but by the use of digital adders. Likewise, the equivalent of broadcast is realized in a tree manner for the sake of maintaining power levels. 1If sufficient power exists or if low signal-tonoise ratios can be tolerated, broadcast can actually be done in one stage by the optical replication of a single beam. Integration of moderate numbers of beams could also be done in one stage by the digital addition of the inputs from several optical detectors.2 These operations are useful, for instance, in realizing matrixvector product architectures.2 Now we show that the digitaloptical equivalents of a general class of analog optical systems can be constructed. A rather general class of analog optical systems is the class of so-called quadraticphase systems.17,18 Optical systems involving an arbitrary sequence of thin lenses and sections of free space belong to this class. They are characterized by the inputoutput relation 1for simplicity in one dimension2: fout1x2 5
2`
or still other forms.1820 qp denotes a quadraticphase system, lns a lens, fs free-space propagation, ft a Fourier transform, and frft a fractional-order Fourier transform,18,2125 with their parameters, focal lengths 1 f 2, distance 1d2, order 1a2, etc., given in parentheses. Because we already have the equivalents of lenses and a Fourier transformer, Eq. 122 will allow the realization of any quadratic-phase system 1including imaging systems, free-space propagation, etc.2. Alternatively, instead of taking the Fourier transform as the basic building block, we can use freespace propagation or the fractional Fourier transform as the basic building block. Digital equivalents of free-space propagation and the fractional Fourier transform can both be realized by the use of ,log N-stage networks, which are not unlike that shown in Fig. 2. Then Eq. 132 or Eq. 142 allows the realization of any quadratic-phase system. The essential point is that quadratic-phase systems can be decomposed into local operations that can be performed by the digital SLMs and global operations that can be efficiently realized with free-space optics. 1The latter is possible because the connection patterns required for the global operations are sufficiently regular.352 Finally, we note that systems more general than quadratic-phase systems 3given by Eq. 1124 can be realized if any number of SLMs 1programmable lters2 are sandwiched between any number of quadratic-phase systems. Because we know how to build digital equivalents of both the lters and the quadratic-phase systems, the equivalent of any analog Fourier optical system composed of lenses, lters, SLMs, and sections of free space 1including anamorphic systems2 can be realized digitally. It is perhaps also worth noting that the ease with which complex spatial ltering is performed makes holography a redundant concept for digital systems. These digital systems can perform any of the mathematical operations performed by real-time holograms in, for example, the Fourier plane. Complex numbers are equally well represented, stored, and operated on in the form of binary streams. 1However, this does not exclude the use of physical holographic elements in
realizing the physical optical connections of digital Fourier optics systems, such as the connections shown in Fig. 2.2 In some cases, it may be convenient to implement certain suboperations within a digital Fourier optics system by the use of true analog elements, instead of by replacing them also with their digital equivalents. For instance, it might be more simple to implement magnication by a simple optical imaging conguration. Because a one-to-one mapping is in question here, the digital nature of the signals are preserved, and the necessary dynamic range might be attained.
5. Discussion and Conclusion
The digitaloptical systems being considered also have certain disadvantages with respect to their analog counterparts. Apart from the fact that the cost and the complexity of the optical system have greatly increased, we now have to wait at least of the order of log N times the intrinsic device delay Td to perform a DFT, whereas in the analog case the delay was of the order of 1 3 Td. The bit rate of the signals passing through the systems is now limited by the speed of the optical devices, so that the system is no longer bandwidth transparent, but this is usually the case in conventional analog systems anyway, because of the inputoutput devices. 1An exception may be photonic-switching systems in which the input and the output are optical.2 Current technology suggests that these digitaloptical systems may be clocked at rates of 100 MHz or more. 1In fact, because the limitations associated with electronic inputoutput from the chip are eliminated, it may be well possible to run optically interconnected chips at clock rates higher than those of electrically interconnected chips. For example, Lentine et al.26 recently reported running 0.9-m silicon electronic circuits with optical inputoutput at 700 [email protected] Combined with unlimited accuracy and cascadability and the ability to utilize substantial logical complexity in each optical pixel 1e.g., thousands of gates per pixel2, the loss of optical transparency is not considered a major disadvantage. A more signicant disadvantage is that the number of pixels one can currently handle with digital systems is small compared with that of conventional analog systems. One reason for this is that yield considerations limit the size of the SEED arrays that can be built, a limitation that might be alleviated with improving technology. Based on previous experience with SEED technology, it appears that thousands of optical pixels will be feasible, although this remains to be demonstrated. Another limitation is that greater tolerances must be satised by the optics that maps the spots of light from device plane to device plane, limiting attainable spacebandwidth products to values smaller than are common with analog systems. Multistage systems with thousands of beams per stage have, however, been demonstrated.8
For some applications, the smaller array of pixels that these systems can handle 1compared with analog systems2 may be sufficient. As one example, let us consider the highly successful image compression method known as transform-domain coding.12 In this method the image is broken down into blocks 1subimages2, and each block is transformed independently. A typical size for these blocks is 8 3 8 pixels, amounting to 642 5 4096 blocks for a 512 3 512 pixel image. In general, choosing larger block sizes enables one to improve the amount of compression obtained. However, studies have shown that increasing the block size beyond 16 3 16 or 32 3 32 does not offer any further advantage.12 Thus the optimal size of the blocks well matches the capabilities of the digitaloptical systems discussed. Assuming 162 5 256 blocks of 32 3 32 pixels each, the digitaloptical system discussed above will require 256 3 160 ns 1 3.2 s 5 44.2 s to compute the transform of all blocks, whereas the electronic three DSP chip board will require 256 3 26 s 5 6.7 ms. For applications for which smaller arrays of pixels are not sufficient 1such as processing a 512 3 512 image as a whole2, one must explore means of overcoming this limitation. One approach is as follows. Because with developing submicrometer technology we can expect to have enough silicon circuitry to handle a far greater number of data pixels 1e.g., those of a sampled image2 than the available optical inputoutput pixels, we might consider the strategy of multiplexing the information belonging to several data pixels through a single optical connection. However, this will cause problems if the information emanating from different data pixels wants to go to different places, an issue that must be explicitly considered. An alternative approach is to break the limit imposed by single arrays of SEED devices and single-lens imaging by the devising of ways of tiling these in the transverse dimension to realize effectively larger arrays 1as discussed in our numerical example above2. In general, analog systems are useful in applications that require the capacity to deal with a large number of pixels, but in which low-amplitude accuracy and limited exibility can be tolerated. Digital systems, on the other hand, can provide greater exibility and amplitude accuracy, but can deal with a somewhat smaller number of pixels 1although this may change if the above strategies are successful2. Digital systems also introduce a larger number of device delays compared with analog systems, but this issue should usually be secondary in deciding which approach to pursue for a given application. Throughput 1information processed per second2 is likely to be more important than latency 1processing delay2 in many signal processing applications, as long as the delay is within reasonable bounds. Another critical issue is the comparison of digital Fourier optical systems with fully electronic digital systems. Technologies for vertically integrating large numbers of silicon chips are now available.27
10 March 1996 @ Vol. 35, No. 8 @ APPLIED OPTICS 1217
The density of vias in the vertical direction is much lower than the density of connections within the chip, but is comparable with the density of optical pinouts possible with the optoelectronic technology mentioned in this paper. Does this mean that one can always build the equivalent of a digital Fourier optical system by using such fully electronic vertical integration technologies? The answer is no. The vias in the mentioned electronic technologies provide direct connections only to the closest region of the next chip in the stack, whereas the optical system under consideration can provide global connections. The general principles of VLSI complexity theory and studies such as Refs. 15 and 16 clearly indicate that the ability to provide global connections will result in more efficient implementation of structures that require global information ow, such as the DFT, at least beyond a certain value of N. 1This does not exclude the use of vertical integration technologies to increase the amount of silicon available per pixel, if needed, in digital Fourier optical systems.2 For a quantitative discussion and for consideration of applications that require intermediate levels of globality, we refer the reader to the mentioned references. Over the past 30 years, the elegance and the promise of analog Fourier optics have resulted in a large number of architectures and systems being proposed and studied.2 Some of these concentrated on image processing 1including pattern recognition2, some on algebraic processing 1including matrix operations2, and others on parallel processing 1including systolic and cellular systems2. Although some of the suggested systems became successful because the application was one whose requirements were well matched to the capabilities of an analog system, others failed because they required the accuracy or the exibility of digital systems, which analog systems could not provide. We believe that reconsideration of digital Fourier optical versions of these systems and architectures will allow the potential of these concepts, architectures, and algorithms to be realized. The primary research task in this area is the creation and the development of a exible platform that allows the wide class of systems alluded to in this paper to be realized by the use of standard components and procedures. We are pleased to acknowledge stimulating discussions with Joe Ford, Fouad Kiamilev, Ashok Krishnamoorthy, and Levent Onural.
References
1. J. W. Goodman, Introduction to Fourier Optics 1McGraw-Hill, New York, 19682. 2. J. W. Goodman, Four decades of optical information processing, Opt. Photonics News 2122, 1115 119912. 3. H. M. Ozaktas, Y. Amitai, and J. W. Goodman, Comparison of system size for some optical interconnection architectures and the folded multi-facet architecture, Opt. Commun. 82, 225228 119912. 4. H. M. Ozaktas and D. Mendlovic, Multi-stage optical intercon1218 APPLIED OPTICS @ Vol. 35, No. 8 @ 10 March 1996
5.
6.
7.
8.
9.
10.
11.
12. 13.
21.
22.
23.
nection architectures with least possible growth of system size, Opt. Lett. 18, 296298 119932. D. Mendlovic and H. M. Ozaktas, Optical-coordinate transformation methods and optical-interconnection architectures, Appl. Opt. 32, 51195124 119932. See, for example, the special issue on smart pixels, S. R. Forrest and H. S. Hinton, eds., IEEE J. Quantum Electron. 29122, 119932. See, for example, D. A. B. Miller, Computing with light, in 1995 Yearbook of Science and the Future 1Encyclopedia Britannica, Chicago, 19942, pp. 134147. F. B. McCormick, T. J. Cloonan, A. L. Lentine, J. M. Sasian, R. L. Morrison, M. G. Beckman, S. L. Walker, M. J. Wojcik, S. J. Hinterlong, R. J. Crisci, R. A. Novotny, and H. S. Hinton, Five-stage free-space optical switching network with eldeffect transistor self-electro-optic-effect-device smart-pixel arrays, Appl. Opt. 33, 16011618 119942; H. S. Hinton, T. J. Cloonan, F. B. McCormick, A. L. Lentine, and F. A. P. Tooley, Free-space digital optical systems, Proc. IEEE 82, 1632 1649 119942. K. S. Huang, C. B. Kuznia, B. K. Jenkins, and A. A. Sawchuk, Parallel architectures for digital optical cellular image processing, Proc. IEEE 82, 17111723 119942. K. W. Goossen, J. E. Cunningham, and W. Y. Jan, GaAs 850 modulators solder-bonded to silicon, IEEE Photonics Technol. Lett. 5, 776778 119932; K. W. Goosen, J. A. Walker, L. A. DAsaro, S. P. Hui, B. Tseng, R. Leibenguth, D. Kossives, D. D. Bacon, D. Dahringer, L. M. F. Chirovsky, A. L. Lentine, and D. A. B. Miller, GaAs MQW modulators integrated with silicon CMOS, IEEE Photonics Technol. Lett. 7, 360362 119952. F. Kiamilev and A. Krishnamoorthy, Smart pixel designs for image processing, in Photonics for Processors, Neural Networks, and Memories II, J. L. Horner, B. Javidi, and S. T. Kowel, eds., Proc. Soc. Photo-Opt. Instrum. Eng. 2297, 37 119942. A. K. Jain, Fundamentals of Digital Image Processing 1Prentice-Hall, Englewood Cliffs, N.J., 19892, p. 505. S. G. Smith and P. B. Denyer, Efficient bit-serial complex multiplication and sum-of-products computation using distributed arithmetic, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, 1986 1IEEE, New York, 19862, pp. 22032206. R. F. Lyon, Twos complement pipeline multipliers, IEEE Trans. Commun. 24, 418425 119762. H. M. Ozaktas and J. W. Goodman, Elements of a hybrid interconnection theory, Appl. Opt. 33, 29682987 119942. A. V. Krishnamoorthy, P. J. Marchand, F. E. Kiamilev, and S. C. Esener, Grain-size considerations for optoelectronic multistage interconnection networks, Appl. Opt. 31, 5480 5507 119922. M. J. Bastiaans, Wigner distribution function and its application to rst-order optics, J. Opt. Soc. Am. 69, 17101716 119792. H. M. Ozaktas and D. Mendlovic, Fractional Fourier optics, J. Opt. Soc. Am. A 12, 743751 119952. A. Papoulis, Signal Analysis 1McGraw-Hill, New York, 19772, p. 289. M. Nazarathy and J. Shamir, First-order opticsa canonical operator representation: lossless systems, J. Opt. Soc. Am. 72, 356364 119822. H. M. Ozaktas, B. Barshan, D. Mendlovic, and L. Onural, Convolution, ltering, and multiplexing in fractional Fourier domains and their relation to chirp and wavelet transforms, J. Opt. Soc. Am. A 11, 547559 119942. A. W. Lohmann, Image rotation, Wigner rotation, and the fractional Fourier transform, J. Opt. Soc. Am. A 10, 2181 2186 119932. A. W. Lohmann and B. H. Soffer, Relationships between the
RadonWigner and fractional Fourier transformations, J. Opt. Soc. Am. A 11, 17981801 119942. 24. L. B. Almeida, The fractional Fourier transform and timefrequency representations, IEEE Trans. Signal Process. 42, 30843091 119942. 25. A. C. McBride and F. H. Kerr, On Namiass fractional Fourier transforms, IMA J. Appl. Math. 39, 159175 119872. 26. A. L. Lentine, K. W. Goossen, J. F. Walker, L. M. F. Chirovsky, L. A. DAsaro, B. Tseng, R. E. Leibenguth, D. Kossives, D.
Dahringer, D. D. Bacon, and T. K. Woodward, 700 Mb@s operation of optoelectronic switching nodes comprised of ip-chip-bonded GaAs@AlGaAs MQW modulators on silicon CMOS circuitry, in Conference on Lasers and Electro-Optics, Vol. 15 of 1995 OSA Technical Digest Series 1Optical Society of America, Washington, D.C., 19952, postdeadline paper CPD11. 27. M. J. Little and J. Grinberg, The 3-D computer: an integrated stack of WSI wafers, in Wafer-Scale Integration 1Kluwer, New York, 19882, Chap. 8.
1219