Digitizing and Archiving, Susie Kopecky #InternetLibrarian

Allan Hancock College, Santa Maria

Have an archive from the Hancock family, whose estate is the site of the college. Also have an archive devoted to the history of the college.

Hancock was a wealthy oilman in Southern California in the early 20th century.

They have approximately 60 large, flat archival boxes, 28 wider boxes, photo boxes, newspaper clippings, correspondence, etc. Previous librarians cleaned the items and moved them to acid-free containers. None trained as archivists.

1. Cleaned and entered into an Access database.
2. Cleaned but not entered into the dB.
3. Neither cleaned nor entered into the dB.

A storage container suffered water damage.

Metadata: decide what info to enter and how to label items with unique identifiers. Former librarians volunteered.

* Year object created
* Accession no.
* Title
* Author
* Brief description
* Part of another collection?
* Subject
* Cross-listed events and individuals

Got a scanner.

Some of the first scans: telegrams and evidence of Captain Hancock’s pursuits.


* Scanner went missing
* Having limited time
* Not currently having volunteers
* MS Access is kind of clunky
* Scanning large amounts of info in a timely fashion. (Want to have something ready for 100th anniversary of college in 2020.)
* Want a cloud-based system to share collection

Hosting possibility: Airtable. Currently being used by a performing arts archive.

Software possibility: ArchivesSpace (formerly Archivist’s Toolkit)

Crowdsourcing ideas for the above. (History students as interns, California Digital Library)

Another problem: nitrate films are flammable. May not be able to keep them.

Suggestion: LC site on personal archiving. (Possibly

Suggestion: Anything you do yourself takes lots of babying. ContentDM is expensive, but very nice.

Open source: Omeka S, Islandora


Brainstorming a content management program, Jaye Lapachet @JayeLapachet #InternetLibrarian

Jaye Lapachet, J8 Consulting

Slides here: http://www.jayelapachet.com/2018/10/17/internet-librarian-2018/

San Bruno fire, PG&E gas line blew up. PG&E had to go through pallets of documents. (SF Chronicle 3/5/2011)

Companies come to her when they are about to do an IPO and need to find documents for the SEC.

Start somewhere: it can be paper or digital.

* Culture
* People
* Process
* Systems
* Audit & control

Vague, but you can make them work for your organization. It has to work for your organization.

Culture: Try to disrupt ongoing business as little as possible.

People: Listen to ideas. Try to do things upfront that are quick wins. Findability can be one. People need to know that the way they do their work and find their information are being considered. They have to know that they’re being heard.

Process: Identifying silos. Don’t segregate by format. When someone goes to look for information, it’s all in there. Where content is needed. Information governance. Review taxonomies, but allow personal terms that may only show up for an individual user. Taxonomies need “care and feeding” (updating, etc.).

Systems: Not just buying tools. Inventory systems, which could be tools or software, but could be processes. What you have and how it’s being used. Expand those, merge them when possible.

Audit & control: regulations, etc.

Get a champion. Have succession planning in place (for yourself as the content manager).

Team collaboration spaces (e.g., Microsoft Teams, Google documents).

Find people’s hidden talents. You can make a database from that.

Think outside the box. Can blockchain help with content management? It’s good with people who don’t trust each other. Walmart is testing it with food products. Maybe a QR code could keep track of who opened a document and changed it.

Question about products to share documents with specific people: Lucidea has a project called Presto.

Question about Google Drive and privacy: Google is going to know what you put up there. Make sure you use their business products. I don’t think you have any privacy with Google, but read your contract. Dropbox and Box might be better, since they are meant for business.

When you work for a company, any work you do is for the company. But if people are concerned about privacy, you can anonymize things.

You can read the contract. You can ask for changes.

Her web site is: www.jayelapachet.com.

Edited to add links.

Tomorrow’s architects, Peter Morville @morville #InternetLibrarian

Told story of John Brown, the abolitionist.

Developed the idea of information architecture.

Went to work for Library of Congress in 2009. There were 100 separate sites, which he compared to the Winchester Mystery House. He wrote a report, which offended people. Eventually they came around. He worked on loc.gov, congress.gov, and copyright.gov

The job of the information architect is to create “environments for understanding.” German word “merkwelt,” unique ways of viewing the world.

2016 made him realize how people have lost track of the truth. Working with a Fortune 500 company that didn’t seem to care about their users or the truth. Started walking dogs, because he needed to do something that had nothing to do with the internet.

Wrote a book called “Planning for Everything.” Fable of “The Ant and the Grasshopper.” People who act and plan concurrently succeed.

Perception and emotion may not be so much about the present moment, but guidance for the future.

“Getting Things Done” is a great book, but it focuses on productivity and it argues that there’s only one way to plan. Doesn’t consider meaning.

Borges. “Garden of Forking Paths.”

Many ways to plan.

Four principles: make planning more social, tangible, agile, and reflective. (Mnemonic: STAR)

Six practices: framing, imagining, narrowing, deciding, executing, reflecting. (Mnemonic: FINDER)

“It’s not fear that stops you; it’s your unwillingness to feel fear.” Life’s too short to play it safe, and repression doesn’t work.

Hope = willpower + waypower (Charles Snyder)

Narrowing paths: evaluation and filtering. Drivers and levers.

Searching often works better with breadth first, then depth.

Explaining things backwards can be a good technique. Writing the instructions for something can help you make a decision.

Pivot or persevere?

Involving different people can find different ways around obstacles. Diversity is a cure for unpredictable adversity.

Gary Kasparov: When was the last time you made a bad decision? Reflection. Nobody wants to relive bad experiences. Repression doesn’t work. Recovery requires telling the truth.

Elephants are matriarchal. The oldest female knows the way to food and water. But some of the old ways are closed off, perverse incentives.

John Brown believed the ends justified the means. He started a war to end slavery, which may not be over. Gandhi believed that in the means is the seeds of the ends.

The library is an act of “inspiration architecture.” We’ve never needed libraries more than we do right now. Let’s go toward the fear and create a better future.

Edited to add links.

Services, tools, & techniques for discovery, Gary Price @Infodocket #InternetLibrarian

Slides at http://bit.ly/GaryIL18

Specializes in finding full-text online works on the open web. But also lots of great multimedia material, such as important lectures stream live or saved on YouTube.

Curation, organization still has a huge role.

People like to know things before other people. Once you do this once or twice, you’ve made a friend for life. Send them stuff you know they are interested in.


One Tab: Send all open browser tabs to a single web page.


Wikipedia stats, Wikipedia edits in real time

Open citations: Scholia by wikidata



U.S. government documents registry (Hathi Trust)

CRS reports now officially available. (Not everything online yet.)


Webrecorder.io (saves pages you browse, does not work with PDF docs)

New beta feature of Wayback Machine: save outlinks (i.e., pages linked to from the page you specify)

Zotero-Unpaywall integration

FBO.gov (govt. contracts): good for business research

Court Listener: court filings (sometimes free PACER docs)

Gary recommends Inoreader: RSS, Twitter feeds, Google alerts. Free and pay versions

American Archive of Public Broadcasting

Academic material: search options beyond Google Scholar. (See, for example, my Indexes page)

Edited to add links.

Designing the web for the future, Peter Morville #InternetLibrarian @morville

Information architecture expert, author of the polar bear book and more. Newest: Planning for Everything.

Search is not just technology; it’s a system.

Ann Arbor District Library (aadl.org): got to work with his home library. Analytics, one-on-one interviews and testing, stakeholder interviews. Strategy, wire frames, iterating. They said the site’s not making people aware of all we offer. He asked about mobile strategy. They talked about their app, rather than a responsive web site. Goal was to have it work on a phone, not hiding things behind a “hamburger menu.”

Library has a collection for the blind. He did a user session with a blind woman. She didn’t use the web site; she talks to the librarians. The web site was so difficult to use with a screen reader.

Some of his recommendations weren’t implemented by launch date. Over time, they have added things.

Baker Library, Harvard Business School (library.hbs.edu): Talked about not how do you use the web site or the library, how do you do research? Made schematic maps about the research process and the career path and where the library can help.

Library has about 100 databases; it’s hard to figure out which ones to use. Big beautiful reading room, where you feel you have to be quiet: that’s where they do reference interviews. An MBA student was shown how to do things, but forgot and felt dumb. Need to let people do it themselves.

Library of Congress (loc.gov): Much of his work there was doing wireframes. Shift from more than 100 web sites to three: loc.gov, congress.gov, and copyright.gov

Three levels:
* Portal: home pages and second level pages
* Search: Search across data sources and still be fast
* Objects: create a modest level of templates for these types of things. Many people will arrive at the site via a Google search that leads to these pages.

National Cancer Institute: Top result for cancer, but not for specific kinds of cancer.

Cisco intranet: Difficult to use facets (e.g., 135 content types!)

Question re. Required templates. He puts the trade offs on the table, with both stakeholders and users.

Edited to add links.

Nina Simon, Santa Cruz Museum of Art and History #InternetLibrarian #ofbyforall @ninaksimon

When she started, more people knew the building had been the county jail than knew that it was a museum, and it had been a museum for 20 years.

People ask questions like, how can we get new people into the museum? How can we get girls into engineering? That leads to lazy answers, like *those* people aren’t interested. Ask, instead, what are we willing to change to bring in new people?

In seven years, budget and visitors increased by a factor of ~7.

They try to be of, by, and for the community.

Recent exhibit: invited people to display portraits.

Ofbyforall.org : Self assessment for how well your library (and other cultural organizations) is representing your community.

State Library of Queensland, Brisbane: Aboriginal people said they share knowledge through music, rather than books, around a fire. The library made a space for music and dance with a fire pit!

You can’t just put up a sign that says “welcome” and think that you’re done. You need to welcome people as partners.

Groups like punks and bikers: they got them in by partnerships.

Dia de los Muertos event hadn’t partnered with Latinx people. She changed that. Procession from museum to the cemetery two miles away.

Bilingual staff.

When working with marginalized communities, of-by-for is NOT optional (e.g., foster youth). Start with partners you know and ask them, who else should we partner with?

Of By For All Change Network: group of orgs doing these things.