|
|
Inferring eye movements on the basis of head and visual target positionSOLA International Industry contacts:
MISG moderators:
|
||||||||
|
The companySOLA began in 1956, with nine opticaltechnicians experimenting in a garage in Adelaide,South Australia. Their goal was to cast spectacle lenses from recently discovered plastic materials. After a few early successes, those pioneers founded SOLA (or Scientific Optical Laboratories of Australia), as it was first called) in 1960. SOLA's first overseas subsidiary opened in Japan, in 1968, and in ensuing years, operations were added throughout Asia, Europe and the Americas. In 1979, SOLA was acquired by Pilkington plc, and in 1988 the corporate headquarters moved from Australia to Menlo Park, California. The company's global expansion continued during the 1980's, with new manufacturing operations opening in Venezuela, Taiwan and China. In 1993, SOLA was purchased by AEA Investors Inc., and in March 1995 the company was listed on the New York Stock Exchange. Further double-digit sales growth came from new products, international expansion and through the acqusitions of US lensmakers Neolens and American Optical. SOLA's regional structure now includes North America, Europe, Asia, Australia, South America and an international Sunlens Division. Across the world over 100 million people go about their daily lives wearing SOLA lenses. The company now operates major research and development centres in Adelaide, South Australia and Petaluma, California; supported by a specialist process engineering team at a plant in Wexford, Ireland. From SOLA's earliest years, lens technology has continued to evolve. The result is thinner, lighter lens forms, innovative new designs and high performance coatings that combine exceptional cosmetics with optical excellence. A measure of SOLA's success in technical development and manufacturing is the fact that over 60% of sales come from new, value-added products. IntroductionThe lens design process increasingly relies on complex ray-tracing that requires the geometry of the visual task to be simulated in order for the optical implications to be calculated and plotted. In order to improve our models the location on the lens surface at which distance, intermediate and near tasks are performed needs to be understood. Traditional eye trackers are expensive, cumbersome and intrusive, we have developed a non-intrusive, low cost eye-tracking system based on head and stimulus position measures. The ability to estimate eye movements relies on a precise knowledge of the geometry of the reading task, its location, the location of the readers eyes and an ability to correlate the time series data with these positions. The current system as used in this experiment allows us to measure and estimate, where indicated, the following:
MethodsApparatus & procedureSubjects wear a near or intermediate prescription, depending on the task, fitted to a custom lens clip attached to a demonstrator frame. The clip is fitted with a pair of stock single vision lenses that include the near Sphere and Cyl correction. The lens is circular, 38 mm in diameter. Also attached to the frame is a Polhemus electromagnetic motion sensing cube (see Figure 1). The average distance position of the head receiver cube and the apex of the cornea of the Right eye is 30 mm temporal, 30 mm above and 20 mm in front of the corneal apex. As wearers perform a standardised near and intermediate reading task the position of both the head and book are recorded (10 Hz sampling rate) in X,Y,Z, Azimuth (head turn), Elevation (head nod angle) and Roll space (side to side tipping) using a Polhemus electromagnetic motion sensor system.
Near and Intermediate tasksSubjects read a standardised near and intermediate stimulus as described below. During these tasks head and stimulus position are recorded in real-time with a 10 Hz sampling rate and to an accuracy of approximately 5 mm (X,Y & Z) and 1 degree (Azimuth, Elevation & Roll). The near reading task requires the subject to read text broken into three paragraphs in 10 point Times New Roman font (see Figure 4).
The intermediate reading target is printed on an A4 page (210 mm x 297 mm) in 'landscape' mode. Three rows of three digit numbers in 10 point 'Times New Roman' font are printed so that the top row is 15 mm from the top of the page, the second row is 90 mm below the first and the final row 90 mm below that. The columns flanking three centre column are 135 mm to the left and right of centre. The target is clipped onto the transmitter stand so that the middle column of numbers is in line with the centre of the transmitter cube. We are mainly interested in eye-turn in the intermediate data. Eye declination is not required.
DataNear eye declinationNear Eye declination is based on head declination (elevation) in combination with the relative position of the near target. The physical position of the paragraphs is known relative to the book sensor (which is coincident with the middle paragraph). When the subject is calibrated the wearer is assumed to be in the primary gaze position, that is the eyes and head are level with the horizon. The position of the head and book sensor are projected onto a two dimensional plane based on the X and Z coordinates. The angle of the book and position is calculated relative to the angle of the head sensor and its position. Based on a parsing of the data into top middle and bottom paragraphs, the eye declination is calculated using the fact that the top and bottom paragraphs are 120 mm above and below the middle paragraph. This geometry is inclined at the book angle and the relative average eye declination angles to each paragraph are calculated. The software takes the minimum of the three calculated average eye declinations as the basis for the dispensing model. The minimum is selected since the wearer will be reading high in the near zone or low in the intermediate and this will be the most difficult conditions a progressive addition lens will be expected to perform under (progressive lenses have a channel of clear vision with flanking blur, the width of this channel is of considerable design interest). The difference between these two angles reveals the eye declination required to reach the near target, this data is presented in Figure 6.
Near eye turn and corridor widthEye turn relates directly to the demands a wearer will place on corridor width. Assuming the page is being held centrally, the 1/2 line width is known to be 85mm. The reading distance D is obtained from a trigonometric calculation of a two dimensional projection of the head-sensor and the book sensor in X (back-forward) and Z (vertical) coordinates. In other words, the book is assumed to be held centrally and not off to one side. The inverse tan(85/D) is then used to calculate the eye-rotation angle required for the wearer to reach of the line without a head rotation (EOL). The inferred eye rotation when the wearer reaches the end of the line is then calculated to be EOL - (2 x SDAzimuth), this is done for up to three paragraphs and the mean taken as the final eye-turn value. This value is compared to the standard model to determine the position on the distribution and therefore if the wearer was high, medium or low. Our analysis suggests that this estimate of the maximum head turns is OK but we are interested in better approaches.
Although we cannot know precisely where the wearer is looking at a given instant in the head-track trace we can infer the typical eye-rotations required at the start and end of each line. To do this we first calculate the maximum head rotation angle (taken as 2 twice the SD of the head turn data ~ 95%) and subtract this from the angle required to reach the end of the line (EOL). The difference between these two angles is the inferred eye-rotation. There are a number of assumptions in this trigonometric analysis as follows: First, three eye-rotation values are calculated and the mean of these is taken as the eye turn value. The three individual values should correspond to the top, middle and bottom paragraphs. This calculation relies on the data being passed correctly into the sections of the recording run that correspond to head movements made when reading the three paragraphs. We chose to parse the data by paragraph because a number of wearers make significant head turns between paragraphs and this 'non-reading' head turn information should not be included in the head turn calculations. The parsing algorithm uses the head declination data (elevation) with Z (vertical) movements of the book to compute a relative velocity of these two movements (this velocity can be though of as face sweep across the page), high slopes seen in the vicinity of the 1/3 and 2/3 parts of the trace are assumed to be wearers moving to see the next paragraph. This algorithm appears to work very well but can be confounded by variable reading speed or very low head and book movement. We would are interested in better parsing algorithms. Intermediate dataThe intermediate task reading distance and eye-turn are considered below. We should point out that the external validity, the generalisability of the data to real life situations, is more questionable than the near reading task. Of the tasks that a wearer may engage in, the intermediate task is designed to simulate using a computer monitor. An A4 landscape page most closely resembles the geometry of a 15" monitor. The reader wishing to ponder this issue is directed to Figure 5. Unlike the near task, the intermediate data is not parsed when calculating the standard deviation of the head turn/Eye Turn data nor is any data discarded from the beginning and end of the run. We assume that the operator should press start when they hear the wearer start reading numbers and stop when they reach the last number so there is less ambiguity about when the wearer has started and finished reading. Intermediate eye turnsWe believe calculations to determine eye-turn may indicate that intermediate eye turn is less robust than the near eye turn data. We assume twice the standard deviation encompasses 95% of all eye turns and is therefore a good estimate of the maximum eye turn. As can be seen in Figure 8 the data looks more bi-modal than normally distributed. The 95% figure is based on assumptions of the normal distribution. On the other hand 2 x SD = 10.6 degrees which equates to 13.9 degree 95% cumulative point. The actual 95% cumulative point is 12.5 degrees, a 1.4 degree discrepancy, not enough to account for the -4 to -8 degree eye rotations. It is also possible that the dynamic nature of the intermediate task, reading widely spaced numbers, caused some subjects to over-shoot the end of line point. In detail - Eye turn is calculated similar to the near eye turn, that is the maximum head rotation angle (taken as 2X the SD of the head turn data) is subtracted from the head turn angle required for the wearer to point their nose directly at the numbers at the ends of the line (EOL). The difference between these two angles is the inferred intermediate eye-rotation. There are a number of assumptions in this trigonometric analysis as follows: The distance between the head and transmitter cube is taken as the eye to page distance. The geometry of the intermediate stimulus is assumed to have numbers 140 mm either side of the central number on each line and the wearer is assumed to be central to the page and not off to one side. Eye turn values therefore reflect the predicted eye turns required at the beginning and end of each line in order to fixate the first and last number on the line.
Problems for MISG to consider
|