When it Rains, It Pours: A Digital Collections News Round Up

2015 has been a banner year for Duke Digital Collections, and its only January! We have already published a new collection, broken records and expanded our audience. Truth be told, we have been on quite a roll for the last several months, and with the holidays we haven’t had a chance to share every new digital collection with you. Today on Bitstreams, we highlight digital collection news that didn’t quite make the headlines in the past few months.

H. Lee Watersmania

waterschart
Compare normal Digital Collections traffic to our Waters spike on Monday January 19th.

Before touching on news you haven’t about, we must continue the H. Lee Waters PR Blitz. Last week, we launched the H. Lee Waters digital collection. We and the Rubenstein Library knew there was a fair amount of pent-up demand for this collection, however we have been amazed by the reaction of the public. Within a few days of launch, site visits hit what we believe (though cannot say with 100% certainty) to be an all time high of 17,000 visits and 37,000 pageviews on Jan 19.  We even suspect that the intensity of the traffic has contributed to some recent server performance issues (apologies if you have had trouble viewing the films – we and campus IT are working on it).

We have also seen more than 20 new user comments left on Water’s films pages, 6 comments left on the launch blog post, and 40+ new likes on the Duke Digital Collections Facebook page since last week. The Rubenstein Library has also received a surge of inquiries about the collection. These may not be “official” stats, but we have never seen this much direct public reaction to one of our new digital collections, and we could not be more excited about it.

Early Greek Manuscripts

An example from the early Greek Manuscript collection.
An example from the early Greek Manuscript collection.

In November we quietly made 38 early Greek manuscripts available online, one of which is the digital copy of a manuscript since returned to the Greek government.  These beautiful volumes are part of the Rubenstein Library and date from the 9th – 17th centuries.   We are still digitizing volumes from this collection, and hope to publish more in the late Spring.  At that time we will make some changes to the look and feel of the digital collection.  Our goal will be to further expose the general public to the beauty of these volumes while also increasing discoverability to multiple scholarly communities.

 

Link Media Wall Exhibit

In early January, the libraries Digital Exhibits Working Group premiered their West Campus Construction Link media wall exhibit, affectionately nicknamed the Game of Stones.   The exhibit features content from the Construction of Duke University digital collection and the Duke University Archives’ Flickr sets.   The creation of this exhibit has been described previously on Bitstreams (here and here).  Head on down to the link and see it for yourself!campus_constr

 

History of Medicine Artifacts

Medicine bottles and glasses from the HOM artifacts collection.

Curious about bone saws, blood letting or other historic medical instruments? Look no further than the Rubenstein Libraries History of Medicine Artifact’s Collection Guide.   In December we published over 300 images of historic medical artifacts embedded in the collection guide.  Its an incredible and sometimes frightening treasure trove of images.

These are legacy images taken  by the History of Medicine.  While we didn’t shoot these items in the Digital Production Center, the digital collections team still took a hands on approach to normalizing the filenames and overall structure of the image set so we could publish them.  This project was part of our larger efforts to make more media types embeddable in Rubenstein collection guides, a deceptively difficult process that will likely be covered more in depth in a future Bitstreams post.

Digitization to Support the Student Nonviolent Coordinating Committee (SNCC) Legacy Project Partnership

Transcript from an oral history in the Joseph Sinsheimer papers.
Transcript from an oral history in the Joseph Sinsheimer papers.

In the last year, Duke University Libraries has been partnering with the SNCC Legacy Project and the Center for Documentary Studies on One Person One Vote: The Legacy of SNCC and the Fight for Voting Rights.  As part of the project, the digital collections team has digitized several collections related to SNCC and made content available from each collections’ collection guide.  The collections include audio recordings, moving images and still images.  Selections from the digitized content will soon be made available on the One Person One Vote site to be launched in March 2015.  In the meantime, you can visit the collections directly:  Joseph Sinsheimer PapersFaith Holsaert Papers, and SNCC 40th Anniversary Conference.

 

Coach 1K

Coach K’s first Duke win against Stetson.

This one is hot off the digital presses.  Digital Collections partnered with University Archives to publish Coach K’s very first win at Duke just this week in anticipation of victory # 1000.

What’s Next for Duke Digital Collections?

The short answer is, a lot!  We have very ambitious plans for 2015.  We will be developing the next version of our digital collections platform, hiring an intern (thank you University Archives), restarting digitization of the Gedney collection, and of course publishing more of your favorite digital collections.   Stay tuned!

Embeds, Math & Beyond

This week, in conjunction with our H. Lee Waters Film Collection unveiling, we rolled out a handy new Embed feature for digital collections items.  The idea is to make it as easy as possible for someone to share their discoveries from our collections, with proper attribution, on other websites or blogs.

How To

It’s simple, really, and mimics the experience you’re likely to encounter getting embed code from other popular sites with videos, images, and the like. We modeled our approach loosely on the Internet Archive‘s video embed service (e.g., visit this video and click the Share icon, but only if you are unafraid of clowns).

Embed Link

Click the “Embed” link under an item from Duke Digital Collections, and copy the snippet of code that pops up. Paste it in your website, and you’re done!

Examples

I’ll paste a few examples below using different kinds of items. The embed code is short and nearly identical for all of these:

A Single Image

Paginated Item

A Video

Single-Track Audio

Multi-Track Audio

Document with Document Viewer

Technical Considerations

Building this feature required a little bit of math, some trial & error, and a few tricks. The steps were to:

  • Set up a service to return customized item pages at the path http://library.duke.edu/digitalcollections/embed/<itemid>/
  • Use CSS & JS to make the media as fluid as possible to fill whatever space it ends up in
  • Use a fixed height and overflow: auto on the attribution box so longer content will scroll
  • Use link rel=”canonical” to ensure the item’s embed page is associated with the real item page (especially to improve links / ranking signals for search engines).
  • Present the user a copyable HTML <iframe> element in the regular item page that has the correct height & width attributes to accommodate the item(s) to be embedded

This last point is where the math comes in. Take a single image item, for example. With a landscape-orientation image we need to give the user a different <iframe> height to copy than we would for a portrait. It gets even more complicated when we have to account for multiple tracks of audio or video, or combinations of the two.

Coming Soon

We’ll refine this feature a bit in the coming weeks, and work out any embed-bugs we discover. We’ll also be developing a similar feature for embedding digitized content found in our archival collection guides.

“See Yourself in the Movies!”: H. Lee Waters Goes Online

 

hleewaters-200x300_desaturated
H. Lee Waters, circa 1942.

When in the late 1980s Duke Libraries first began collecting H. Lee Waters’ “Movies of Local People,” the only way the films could be seen was through projection or, given the deluxe treatment, played back on a video tape.  Nearly 30 years later the David M. Rubenstein Rare Book & Manuscript Library and Duke Library’s Digital Projects and Production Services team are proud to bring the films online.  It is an important milestone for a collection that has grown so organically through the years and whose individual films have come from so many sources.

The motion picture films in the H. Lee Waters Collection play out a history of North Carolina (and Virginia, and South Carolina) in the late 1930s and early 1940s unparalleled in scope and vision.  But what would eventually become such a grand gift to the citizens and scholars and artists of the region did not begin with that in mind.  Like fellow commercial photographer and North Carolinian Hugh Mangum, Waters might be considered an accidental documentarian, taking to the road in the depths of the Depression as a resourceful businessman, filling theatre seats with audiences who paid to see themselves in the movies.  And yet, a natural behind the camera, Waters knew composition and how to frame a shot; more importantly, he knew people, loved to be around them, and could draw from his subjects positive reactions to this unexpected man with a camera, outside the mill, on main street, in front of the school, in the shop.  As Waters biographer and documentarian Tom Whiteside has noted, Waters’ quick-cut aesthetic managed the immediate goal of getting as many townsfolk into the movie as possible while achieving, in the long-term, an archive of still image frames that is vast in its scope and ripe for investigation.  From this perspective, the vernacular of his art puts him in the company of the prominent documentary photographers of his day.

 

A still from Clayton (N.C.), circa 1936-1937 (Reel 1), one of the films in H. Lee Waters Digital Collection.
A still from “Clayton (N.C.), circa 1936-1937 (Reel 1),” one of the films in H. Lee Waters Digital Collection.

Waters used reversal film, and the film he projected was the same film he shot in the camera, edited for length and his beloved special effects.  He worked quickly, didn’t make copies, and after coming off the road in 1942 shelved the films until, later in life, he started selling them to their respective communities.  Duke’s collection of H. Lee Waters films therefore owes a debt to the towns, libraries, and historical societies who over the years have sent, and continue to send, Waters’ legacy to Duke, recognizing that centralizing these resources works in favor of the region’s cultural heritage.  It also means that over the years Duke has accrued film in all conditions and states of preservation.  There is film in the collection that is literally turning to dust; there is also beautiful Kodachrome that could have been shot yesterday.  Since 1988, too, audiovisual preservation has changed dramatically.  Thankfully, and with the help of the National Film Preservation Foundation, a substantial number of the films have received full film-to-film preservation; nevertheless, earlier, heroic attempts at saving some films to videotape, some formulations of which are now severely degrading, have left us in a few cases with only a blurred shadow of what must have been on that original film.  So our digital project reflects the films and their creator, but also the history of the collection at Duke.

A still image from Kannapolis (N.C.), 1941 (Reel 1), one of the films in the H. Lee Waters Digital Collection.
A still  from “Kannapolis (N.C.), 1941 (Reel 1),” one of the films in the H. Lee Waters Digital Collection.

Many at Duke Libraries have made the Waters collection what it is today, and those of us working on bringing the films online build on the efforts of librarians, archivists, and technical staff who were as passionate about these movies as we are.  Ever in transition, the collection is marked by growth, an element that we see as integral to the website.  In fact we are already adding to it.  In addition to the films and (for some of them) shotlists, there are oral history interviews with the children of H. Lee Waters.  Tom Waters and Mary Waters Spaulding have not only been essential in bringing their father’s films online, they have a unique perspective on a talented man whose contribution to the history of North Carolina was only beginning to be appreciated when he died in 1997.  Waters’ home movies will be added to the site soon, and we anticipate presenting select work inspired by the Waters films, because, in addition to their own sublime artistry, the movies remain a magnet for artists and documentarians mining archival sources.  One such work will debut March 20 for Duke Performances, as Jenny Scheinman premieres her work “Kannapolis: A Moving Portrait,” based around film from the collection.

A still from "Smithfield (N.C.) and Selma (N.C.), 1937 (Reel 1)," one of the films in the H. Lee Waters Digital Collection.
A still from “Smithfield (N.C.) and Selma (N.C.), 1937 (Reel 1),” one of the films in the H. Lee Waters Digital Collection.

Of course, we also hope the site might draw other Movies of Local People out of hiding, because while Duke and the State Archives hold a good number of the films, we still don’t know the whereabouts of some of them.  So when you visit the site, take advantage of the embed and share functions accompanying each of the videos, use them on your blog or Facebook page, guide people to H. Lee Waters at Duke, and who knows? It may lead them to investigate further, to liberate that can of film that’s been sitting in the closet or biding its time at the local library.

Post Contributed by Craig Breaden, Audiovisual Archivist, David M. Rubenstein Rare Book & Manuscript Library.

Winter Cross-Training in the DPC

The Digital Production Center engages with various departments within the Libraries and across campus to preserve endangered media and create unique digital collections. We work especially closely with The Rubenstein Rare Book, Manuscript, & Special Collections Library, as they hold many of the materials that we digitize and archive on a daily basis. This collaboration requires a shared understanding of numerous media types and their special characteristics; awareness of potential conservation and preservation issues; and a working knowledge of digitization processes, logistics, and limitations.

In order to facilitate this ongoing collaboration, we recently did a semester-long cross-training course with The Rubenstein’s Reproductions Manager, Megan O’Connell. Megan is one of our main points of contact for weekly patron requests, and we felt that this training would strengthen our ability to navigate tricky and time-sensitive digitization jobs heading into the future. The plan was for Megan to work with all three of our digitization specialists (audio, video, & still image) to get a combination of hands-on and observational learning opportunities.

IMG_1068

Still image comprises the bulk of our workload, so we decided to spend most of the training on these materials. “Still image” includes anything that we digitize via photographic or scanning technology, e.g. manuscripts, maps, bound periodicals, posters, photographs, slides, etc. We identified a group of uniquely challenging materials of this type and digitized one of each for hands-on training, including:

  • Bound manuscript – Most of these items cannot be opened more than 90 degrees. We stabilize them in a custom-built book cradle, capture the recto sides of the pages, then flip the book and capture the verso sides. The resulting files then have to be interleaved into the correct sequence.
  • Map, or other oversize item – These types of materials are often too large to capture in one single camera shot. Our setup allows us to take multiple shots (with the help of the camera being mounted on a sliding track) which we then stitch together into a seamless whole.
  • Item with texture or different item depths, e.g. a folded map, tipped into a book – It is often challenging to properly support these items and level the map so that it is all in focus within the camera’s depth of field.
  • ANR volume – These are large, heavy volumes that typically contain older newspapers and periodicals. The paper can be very fragile and they have to be handled and supported carefully so as not to damage or tear the material.
  • Item with a tight binding w/ text that goes into the gutter – We do our best to capture all of the text, but it will sometimes appear to curve or disappear into the gutter in the resulting digital image.

IMG_1069

Working through this list with Megan, I was struck by the diversity of materials that we collect and digitize. The training process also highlighted the variety of tricks, techniques, and hacks that we employ to get the best possible digital transfers, given the limitations of the available technology and the materials’ condition. I came out of the experience with a renewed appreciation of the complexity of the digitization work we do in the DPC, the significance of the rare materials in the collection, and the excellent service that we are able to provide to researchers through the Rubenstein Library.

IMG_1065

Check out Megan’s blog post on the Devil’s Tale for more on the other media formats I wasn’t able to cover in the scope of this post.

Here’s to more collaboration across boundaries in the New Year!

IMG_1173

Holiday Notes from the Homefront and Abroad

The following is a series of loosely linked stories, loosely based on our digital collections, and loosely related to the holidays, where even the word “loosely” is applied with some looseness.


 

War ration book one. Ration order no. 3.

An American looking forward to baking delicious treats for the holidays in 1942 would have been intimately familiar with War Ration Book One. The Office of Price Administration issued Ration Order No. 3 in April of that year, and distributed the ration books via elementary schools in the first week of May. Holders could purchase one pound of sugar every two weeks between May 5 and June 27. By the end of the year, butter, coffee, and other foods joined the list of regulated goods.

As the holidays approached, the newspapers ran articles advising homemakers how to cope with the unavailability of key ingredients. Vegetable shortening could help stretch butter, molasses made cookies prone to burning, and fruit juice was a natural sweetener. The New Orleans Times-Picayune’s “Round Table Talk About Food” exhorted homemakers to make the best of it:

There is something stimulating this approaching holiday time in planning Christmas meals and gift packages or baskets with those substitute items we are permitted to use, rather than with the usual abundance of foods to suit every whim of the appetite.


"YMCA Christmas, Children" (1917-1919)
“YMCA Christmas, Children” (1917-1919)

I wrote before about how YMCA missionaries took basketball overseas after its invention, including to Japan. Did they also take Santa beards to China?


hfccp010130010
“Certificate of registration to buy, sell, and transfer men’s rubber boots and rubber work shoes.” Ration order no. 6, September 29, 1942.

The Office of Price Administration provided Duke Law alum Richard Nixon with his first job in Washington, beginning in January of 1942. Rubber was his area of focus. He was industrious and diligent in his work, and by March, had been promoted to “acting chief of interpretations in the Rubber Branch.

But the life of a government regulator was not to be for Dick Nixon. He joined the Navy in August, and by year’s end found himself serving at an airfield in Ottumwa, Iowa.

– Summarized from Richard M. Nixon: A Life in Full, by Conrad Black.


True story. Terry Sanford spent December 21 and 22 of 1944 riding in a convoy that took the 1st Battalion of the 517th Parachute Regimental Combat Team from Soissons, France to the town of Soy in Belgium. His unit fought Germans for the next few days, losing more than a hundred men, in the conflagration that became known as the Battle of the Bulge.

They were able to sleep on Christmas Eve. On Christmas morning, there was roasted turkey, but at noon orders came to take a hill, which they did. The next day, they held it, repelling a German counterattack.

In the action, Sanford tackled a German officer, disarmed him, and drove him off for interrogation. Years later, he speculated that the man was probably shot before being processed as a POW, as retaliation against a recent massacre of American troops.

Sanford would write home that “things are going well in this country,” and they had “[m]ore food than elsewhere,” without explaining why there was more to go around.

While riding with his commander, Lt. Colonel “Wild Bill” Boyle, shortly after the New Year, he was wounded. He later received the Purple Heart.

– Summarized from Terry Sanford: Politics, Progress, and Outrageous Ambitions, Howard E. Covington and Marion A. Ellis.


wtddy020020490
December 24, 1940 entry from Mary McMillan’s journal.

Mary McMillan’s 1940 journal tells of her experiences as a missionary in Japan. Her entry for Christmas Eve of 1940 reads:

To doctor’s for sinus treatment, then down on Ginza shopping for Christmas presents: perfume for Umeko, a dog purse for Eiko, cookies for Mrs. Natsuzoe, toys for Mineko san and Masao chan. Lunch of fried oysters and fresh strawberries in Olympic Grille.

Brought Eiko home for her first Christmas. Tried to tell her the Christmas Story, but my limited knowledge of Japanese and her excitement made direct teaching impossible. Was up wrapping presents till almost mid-night.

A month later she wrote of talks with “Bishop Abe, and Ambassador Grew:”

They advised us to begin making preparations to leave Japan – as is we were certain that war were coming and we were sure to be called home by our Board.”

Her next entry, dated “Feb. 29, ‘41,” is headed “Homeward Bound.”


Hanukah Variant spellings for hanukkah occur three times in the OCR text of our 1960s Duke Chronicle collection. (Due to the imprecision of OCR, the actual occurrences may be more).

A photograph on the front page of the December 17, 1968, issue depicts a Star of David hanging on the side of a building. The caption reads, “It is now Hanukah, ‘the festival of lights.’”

At the top of that page, the lead story of the issue is headlined, “X-Mas amnesty asked for draft dodgers.” It reports that the “cabinet of the YMCA” at Duke had resolved to write to President Johnson on the matter. Of course, Johnson was a lame duck by then. That same day, the New York Times reported that he had spent an hour conferring with John Mitchell, Nixon’s incoming Attorney General.


Wind Song Haldeman J. Walter Thompson Company News, November 30, 1960, the two lead articles:

WIND SONG USES HANDSOME “SANTA” TO BOOST  PRE-CHRISTMAS SALES

BOB HALDEMAN NAMED MANAGER OF THE LOS ANGELES OFFICE

Eight years later, the newsletter noted the appointments of Haldeman and his subordinate, Ron Ziegler, to the White House staff. Haldeman would serve as Nixon’s Chief of Staff, and later did 18 months in prison for his role in the Watergate coverup. Ziegler became White House Press Secretary.


Sit in
Sit in, Allen Building. December, 1967.

On November 17, 1967 – the Friday before Thanksgiving – the Chronicle ran a story about Terry Sanford and his newly published book, Storm over the States. He started writing Storm soon after leaving the NC governor’s office in 1965. Supported by the Ford Foundation and the Carnegie Corporation, he holed up in an office at Duke, hired a staff, and wrote about a model for state government that is federalist but proactive and constructive. Sanford’s point of view stood in stark, if unmentioned, contrast to the doctrines of “nullification” and states’ rights that segregationists like George Wallace wielded in their opposition to Civil Rights.

That Friday ended a tumultuous week at Duke. The lead story that same day was headlined “Knight bans use of segregated facilities by student groups.” The school’s president, Douglas Knight, had “re-interpreted” a university policy statement prohibiting the use of off-campus facilities that discriminated on the basis of race. Knight extended the policy, purported to apply to staff and faculty organizations, to include student organizations as well.

The previous week, the student body had defeated a referendum that would have had the same effect. Black students reacted by staging a sit in at Knight’s office (one holding a sign that read “Students Await An Overdue Talk With Our WHITE KNIGHT”), demanding that he take action. Knight acceded, complaining in his statement that “the application of this practice would have been made in the normal course of events,” but “we were confronted with an ultimatum, which carried with it a threat of disruption of the ordinary processes of the University.”

Confrontations between the administration and black students continued to escalate, leading to the Allen Building takeover in 1969, Knight’s resignation, and his succession by Terry Sanford.


First Days Back in Japan Mary McMillan’s journals stretch from 1939 to 1991. Her “1939” journal actually contains entries from the 1940s, though there are significant gaps. In October of 1942 she wrote of traveling to Delta, Utah. She didn’t mention her purpose, and no other entries appear from that period, but she was heading to Utah to teach in the Topaz Relocation Center, an internment camp for Japanese Americans. En route, she wrote:

Those nice Marine recruits who got on our train in St. Louis shouldn’t be required to go so far from home to fight for objectives that seem to me not to be in keeping with United Nations Aims, as given in the Atlantic Charter.  Why should Japan “be crushed”? The military mind there – and elsewhere – must be forced from power; but are we on the right track towards achieving that objective? I fear most of us have become too material-minded. By following methods resulting from our materialistic thinking, we only create atmospheres for other hostile “spiritual” forces – like Naziism.

Then, the week of Christmas in 1947 she traveled to Seattle, and embarked on a return to Japan. She arrived in Hiroshima in January, the first Christian missionary to return after the war. She lived there for more than thirty years before retiring.


On page 4 of the December 17, 1967 issue of the Chronicle – just below the article about Terry Sanford’s Storm Over the States – this ad ran: RoadGoesEverOn I have to believe at least a few students and faculty got the book or the record as stocking stuffers that year.

Vagrant Up

Ruby on Rails logo

Writing software for the web has come a long way in the past decade. Tools such as Ruby on Rails, Twitter Bootstrap, and jQuery provide standard methods for solving common problems and providing common features of web applications. These tools help developers use more of their time and energy to solve problems unique to the application they are building.

One of the costs of relying on libraries and frameworks to build software is that your project not only depends on the frameworks and libraries you’ve chosen to use, but these frameworks and libraries rely on still more components. Compounding this problem, software is never really finished. There are always bugs being fixed and features being changed and added. These attributes of software, that it changes and has dependencies, complicates software projects in a few different ways:

  • You must carefully manage the versions of libraries, frameworks, and other dependencies so that you ensure all the pieces work together.
  • Developers working on multiple projects may have to use different versions of the same libraries and packages for each of their projects.
  • If you’re working with a team of developers on a project all members of the team have to make sure their computers are set up with the correct versions of the software and dependencies of the project.

Thankfully, there are still more tools available to help manage these problems. For instance, Ruby Version Manager (RVM) is a popular tools used by Ruby on Rails developers. It lets the software developer install and switch between different versions of Ruby. Other tools, such as Bundler make it possible to define exactly what version of which Ruby Gems (Ruby software packages that add functionality to your own project) you need to install for a particular project. Combined, RVM and Bundler simplify the management of complex project dependencies. There are similar tools available for other programming languages, such as Composer, which is a dependency manager for PHP.

By Fco.plj (Own work) [CC-BY-SA-3.0 (http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons

While many of us already use dependency managers in our work, one tool we haven’t been using that we’re evaluating for use on a new project is Vagrant. Vagrant is a tool for creating virtual machines, self-contained systems that run within a host operating system. Virtual machines are software implementations of a computer system. For instance, using a virtual machine I could run Windows on my Mac hardware.

Vagrant does a few things that may make it even easier for developers to manage project dependencies.

  • With Vagrant you can write a script that contains a set of instructions about what operating system and other software you want to install in a virtual machine. Creating a virtual machine with all the software you need for a given project is as then as simple as typing a single command.
  • Vagrant provides a shared directory between your host operating system and the virtual machine. You can use the operating system you use everyday as you work while the software project runs in a virtual machine. This is significant because it means each developer can continue to use the operating system and tools they prefer while the software they’re building is all running in copies of the exact same system.
  • You can add the script for creating the virtual machine to the project itself making it very easy for new developers to get the project running. They don’t have to go through the sometimes painful process of installing a project’s dependencies by hand because the Vagrant script does it for them.
  • A developer working on multiple projects can have a virtual machine set up for each of their projects so they never interfere with each other and each has the correct dependencies installed.

Here’s how to use Vagrant in the most minimal way:

  1. Download and install VirtualBox
  2. Download and install Vagrant
  3. In a terminal window type:
    vagrant init hashicorp/precise32
  4. After running the following command you will have downloaded, set up, and started a fully functional virtual machine running Ubuntu:
    vagrant up
  5. You can then connect to and start using the running virtual machine by connecting to it via SSH:
    vagrant ssh

"Vagrantup" by Fco.plj - Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons - http://commons.wikimedia.org/wiki/File:Vagrantup.jpg#mediaviewer/File:Vagrantup.jpg

In a more complex setup you’d probably add a provisioning script with instructions for downloading and installing additional software as part of the “vagrant up” process. See the Vagrant documentation for more details about provisioning options.

We’re considering using Vagrant on an upcoming project in an effort to make it easier for all the developers on the project to set up and maintain a working development environment. With Vagrant, just one developer will need to spend the time to create the script that generates the virtual machine for the project. This should save the time of other developers on the project who should only have to install VirtualBox, copy the Vagrant file and type “vagrant up.” At least, that’s the idea. Vagrant has great documentation, so if you’re interested in learning more their website is a good place to start.

Digital Transitions Roundtable

In late October of this year, the Digital Production Center (along with many others in the Library) were busy developing budgets for FY 2015. We were asked to think about the needs of the department, where the bottlenecks were and possible new growth areas. We were asked to think big. The idea was to develop a grand list and work backwards to identify what we could reasonably ask for. While the DPC is able to digitize many types of materials and formats, such as audio and video, my focus is specifically still image digitization. So that’s what I focused on.dt-bc100-book

We serve many different parts of the Library and in order to accommodate a wide variety of requests, we use many different types of capture devices in the DPC: high-speed scanners, film scanners, overhead scanners and high-end cameras. The most heavily used capture device is the Phase One camera system. This camera system uses P65 60 MP digital back with a 72mm Schneider flat field lens. This enables us to capture high quality images at archival standards. The majority of material we digitize using this camera are bound volumes (most of them rare books from the David M. Rubenstein Library), but we also use this camera to digitize patron requests, which have increased significantly over the years (everything is expected to be digital it seems), oversized items, glass plate negatives, high-end photography collections and much more. It is no surprise that this camera is a bottleneck for still image production. In researching cameras to include in the budget, I was hard pressed to find another camera system that can compete with the Phase One camera. For over 5 years we have used Digital Transitions, a New York-based provider of high-end digital solutions, for our Phase One purchases and support. We have been very happy with the service, support and equipment we have purchased from them over the years, so I contacted them to inquire about new equipment on the horizon and pricing for upgrading our current system.captureone

New equipment they turned me onto is the BC100 book scanner. This scanner uses a 100° glass platen and two reprographic cameras to capture two facing pages at the same time. While there are other camera systems that use a similar two camera setup (most notably the Scribe, Kirtas and Atiz), the cameras and digital backs used with the BC100, as well as the CaptureOne software that drives the cameras, are more well suited for cultural heritage reproduction. Along with the new BC100, CaptureOne is now offering a new software package specifically geared toward the cultural heritage community for use with this new camera system. While inquiring about the new system, I was invited to attend a Cultural Heritage Round Table event that Digital Transitions was hosting.

This roundtable was focused on the new CaptureOne software for use with the BC100 and the specific needs of the cultural heritage community. I have always found the folks at Digital Transitions to be very professional, knowledgeable and helpful. The event they put together included Jacob Frost, Application Software R&D Manager for PhaseOne; Doug Peterson, Technical Support, Training, R&D at Digital Transitions; and Don Williams of Image Science Associates, Imaging Scientist. Don is also on the Still Image Digitization Advisory Board with the Federal Agencies Digitization Guidelines Initiative (FADGI), a collaborative effort by federal agencies to FADGI1define common guidelines, methods, and practices for digitizing historical content. They talked about the new features of the software, the science behind the software, the science behind the color technology and new information about the FADGI Still Image standard that we currently follow at the Library. I was impressed by the information provided and the knowledge shared, but what impressed me the most was the fact that the main reason Digital Transitions pulled this particular group of users and developers together was to ask us what the cultural heritage community needed from the new software. WHAT!? What we need from the software? I’ve been doing this work for about 15 years now and I think that’s the first time any software developer from any digital imaging company has asked our community specifically what we need. Don’t get me wrong, there is a lot of good software out there but usually the software comes “as is.” While it is fully functional, there are usually some work-arounds to get the software to do what I need it to do. We, as a community, spent about an hour drumming up ideas for software improvements and features.

While we still need to see follow-through on what we talked about, I am hopeful that some of the features we talked about will show up in the software. The software still needs some work to be truly beneficial (especially in post-production), but Phase One and Digital Transitions are definitely on to something.

Having it “All” – About Library Search Results

This fall we changed the default tool that students and faculty use to research library holdings. We have tools that work well for a broad search and tools that are tailored for more specialized research. So, how is this change working out?

Word cloud depicting the 30 most frequently used search terms. The size of the text is proportional to the number of times the term has been used.
Word cloud depicting the 30 most frequently used search terms. The size of the text is proportional to the number of times the term has been used.

We’ve got numbers and we’ve got opinion. First, let’s look at the numbers.

  • The most used feature on the Duke Libraries website is the search box on the homepage with 211,655 searches performed using the default “All” tab between August 25 and November 16, 2014.
  • Within the “All” tab search results, patrons selected results from Articles 48% of the time, results from Books & Media 44% of the time and other results 8% of the time. These results were presented side-by-side on a single results page.
  • The All search isn’t the only option on our homepage as the Books & Media tab was used 68,566 times and the Articles tab was used 46,028 times during the same timeframe.
  • The five most used search terms were PubMed, Web of Science, JSTOR, RefWorks, and Dictionary of National Biography.
  • The most frequently searched for fictional character was Tom Sawyer.
  • The most searched for person was Dr. Martin Luther King, Jr.

So what thoughts have you shared with us about the search options we provide?

On the Libraries' homepage, you can click the gear icon to choose a different search tab as your customized default.
On the Libraries’ homepage, you can click the gear icon to choose a different search tab as your customized default.
  • During the first four weeks of the semester, 48 people submitted their opinions through a survey linked from the search results page.
  • Thirty percent of survey respondents said that they liked having the Articles and Books & Media results appear side by side on the new search results page.
  • Twenty-seven percent said they thought the page looked cluttered or that it was hard to read.
  • Forty percent said that they did not know you can change the default search tab that appears when you view the Duke Libraries’ homepage.
  • Twenty-five percent said that they did not know that they can choose a more highly-focused search option from the Search & Find menu.
  • Testing with a small group of researchers revealed that it was difficult to locate material from the Rubenstein Library using our default search results screen.

Based on your feedback, we made the following improvements to the search results page during the semester.

  • We de-cluttered the information shown in the Articles and Books & Media columns to make results easier to read.
  • We moved “Our Website” results to the top of the right column.
  • We reduced the space used by the search box on the results page.

In the coming months, we will explore ways of making it easier to find materials from the Rubenstein Library and from University Archives. We are also investigating options for implementing a Best Bets feature on the results page; this would provide clearer access to some of the most used resources.

What can you do to help?

Complete our online survey and tell us what you think about the search tools provided through the Libraries’ homepage.

Assembling the Game of Stones

Back in October, Molly detailed DigEx’s work on creating an exhibit for the Link Media Wall. We’ve finally finalized our content and hope to have the new exhibit published to the large display in the next week or two. I’d like to detail how this thing is actually put together.

HTML Code

In our planning meetings the super group talked about a few different approaches for how to start. We considered using a CMS like WordPress or Drupal, Four Winds (our institutional digital signage software), or potentially rolling our own system. In the end though, I decided to build using super basic HTML / CSS / Javascript. After the group was happy with the design, I built a simple page page framework to match our desired output of 3840 x 1080 pixels. And when I mean simple, I mean simple.

got_assembly

I broke the content chunks into five main sections: the masthead (which holds the branding), the navigation (which highlights the current section and construction period), the map (which shows the location of the buildings), the thumbnail (which shows the completed building and adds some descriptive text), and the images (which houses a set of cross-fading historic photos illustrating the progression of construction). Working with a fixed-pixel layout feels strange in the modern world of web development, but it’s quick and satisfying to crank out. I’m using the jQuery Cycle plugin to transition the images, which is lightweight and offers lots of configurable options. I also created a transparent PNG file containing a gradient that fades to the background color which overlays the rotating images.

Another part of the puzzle I wrestled with was how to transition from one section of the exhibit to another. I thought about housing all of the content on a single page and using some JS to move from one to the next, but I was a little worried about performance so I again opted for the super simple solution. Each page has a meta refresh in the header set to the number of seconds that it takes to cycle through the corresponding set of images and with a destination of the next section of the exhibit. It’s a little clunky in execution and I would probably try something more elegant next time, but it’s solid and it works.

Here’s a preview of the exhibit cycling through all of the content. It’s been time compressed – the actual exhibit will take about ten minutes to play through.

In a lot of ways this exhibit is an experiment in both process and form, and I’m looking forward to seeing how our vision translates to the Media Wall space. Using such simple code means that if there are any problems, we can quickly make changes. I’m also looking forward to working on future exhibits and helping to highlight the amazing items in our collections.

New Angles & Avenues for Bitstreams

This week, we added a display of our most recent Bitstreams blog posts to our Digital Collections homepage (example), and likewise, a view of posts relevant to a given collection on the respective collection’s homepage (example).

Screen Shot 2014-11-12 at 1.19.56 PM

Background

Our Digital Projects & Production team has been writing in Bitstreams at least weekly since February 2014. We’ve had some excellent guest contributors, too. Some posts share updates about new digital collections or additions, while others share insights, lessons learned, and behind-the-scenes looks at the projects we’re currently tackling.

Many of our posts have been featured on our library homepage and library news site. But until now, we haven’t been able to display any of them—not even the ones about new digital collections—alongside the collections themselves. So, if you visited the DukEngineer collection in the past, you likely missed out on Melanie’s excellent overview, which puts the magazine in context and highlights the best of what’s inside.

Past Solutions

Syndicating tagged blog posts for display elsewhere is a pretty common use case, and we’ve used a bunch of different solutions as our platforms have evolved. Each solution has naturally been painstakingly tailored to accommodate the inner workings of both the source and the destination. Seven years ago, we were writing custom XSLT to create and then consume our own RSS feeds in Cascade Server CMS. We have since hopped over to Wordpress for managing news and blogs (whew!). An older version of our digital collections app used WordPress’ XML-RPC API to get tagged posts and parsed them with Python.

These days, our library website does blog syndication by using a combo of WordPress RSS, Drupal’s feed aggregator module, and occasionally Yahoo! Pipes for data mashing and munging. It works well in Drupal, but other platforms require other approaches.

Under the Hood: Angular.js and Wordpress JSON API

Bret Davidson’s Code4Lib 2014 presentation, Towards Pasta Code Nirvana: Using JavaScript MVC to Fill Your Programming Ravioli  (slides) made me hungry. Hungry for pasta, yes, but also for knowledge. I wanted to:

  1. Experiment with one of the Javascript MVC frameworks to learn how they work, and in the process…
  2. Build something potentially useful for digital collections that could be ported over to a new application framework in the future (e.g., from our current Django app to a future Ruby on Rails app).

From the many possibilities, I chose AngularJS. It seemed well-documented, increasingly popular, and with Google’s backing, it seems like it’ll be around for awhile.

WordPress JSON API

Among Angular’s virtues is that it really simplifies the process of getting and using JSON data from an API. I found Wordpress’ JSON API plugin, which was interestingly developed by staff at MoMA so they could use WordPress as a back-end to a site with a Rails front-end. So we first had to enable that for our Bitstreams blog.

AngularJS

angularjsAngularJS definitely helps keep code clean, especially by abstracting the model (the blogposts & associated characteristics, as well as the page state) from the view (indicates how to display the data) from the controller (gets and refines the data into the model, updates the model upon interactions with the view). I’ve done several projects in the past using jQuery and DOM manipulation to retrieve and display data. It usually works, but in the process I create a veritable rat’s nest of spaghetti code wherein /* no amount of commenting */ can truly help disentangle what’s happening.

Angular also supercharges HTML with more useful attributes to control a display. I’ve only just scratched the surface, but it’s clear that built-in directives like ng-repeat and filters like limitTo spare me from writing a ton of Javascript, e.g., <li ng-repeat="post in blogposts | limitTo:pageSize">. After the initial learning curve, the markup is visually intuitive. And it’s nice that directives and filters are extensible so you can make your own.

Source code: controller js, HTML (view source)

Initial Lessons Learned

  • AngularJS has a steeper learning curve than I’d expected; I assumed I could do this mini-project in a few hours, but it took a couple days to really get a handle on the basic pieces I needed for this project.
  • Writing an Angular app within a Django app is tricky. Both use {{ variable }} template tags so I had to change Angular to use [[ variable ]] instead.

Looking Ahead

I consider this an encouraging proof of concept. While our own blog posts can be interesting, there are many other sources of valuable data out in the world that are relevant to our collections that would add value for our researchers if we were able to easily get and display them. AngularJS won’t be the answer to all of these needs, but it’s nice to have in the toolset.

Notes from the Duke University Libraries Digital Projects Team