@pipeworx/openalex
Connect: https://gateway.pipeworx.io/openalex/mcp · Install: one-click buttons
Tools: 4
OpenAlex is a free, open replacement for Microsoft Academic Graph (which shut down in 2022). 240M+ scholarly works with structured data on authors, institutions, concepts, citations, and venues. Open-source data model, generous API. Free, no auth (polite User-Agent + email recommended).
Why this matters for AI agents
Where Semantic Scholar is search-focused and Crossref is DOI-focused, OpenAlex is the most comprehensive structured graph: papers + authors + institutions + funders + concepts, all linked. For institutional analysis, citation networks, or systematic literature review, OpenAlex covers ground the others don’t.
Common flows:
- Work lookup. Find a paper by DOI, title, or OpenAlex ID; get full structured record.
- Author / institution. Search Yale’s CS department’s papers in 2024.
- Concept browsing. Papers tagged with “transformer architecture” or “CRISPR Cas9.”
- Citation graph. “Who cites paper X?” or “What does paper X cite?”
Citable URI: pipeworx://openalex/work/{work_id}.
Auth
Free, public. OpenAlex strongly encourages identifying yourself via mailto= query parameter or User-Agent for “polite pool” priority. Pipeworx forwards [email protected] and User-Agent: Pipeworx (mailto:[email protected]) automatically.
Entity types
OpenAlex models 5 entity types, each with stable IDs:
| Entity | ID prefix | Example |
|---|---|---|
| Work (paper) | W | W2741809807 |
| Author | A | A1234567890 |
| Institution | I | I97018004 (Yale) |
| Venue (journal/conference) | V | V202381698 |
| Concept (subject taxonomy) | C | C41008148 (computer science) |
Works are linked to authors, institutions (where authors are affiliated), venues (where they were published), and concepts (what they’re about). Cross-entity queries are powerful.
Common pitfalls
- Author disambiguation. OpenAlex makes a serious effort but isn’t perfect. The same person may have separate Author IDs across early-career vs late-career; common-name authors split across entities. Cross-reference with ORCID where available.
- Concept hierarchy depth. OpenAlex concepts form a 6-level tree. “Computer science” level 0 is too coarse for most queries; level 3-4 (“transformer model”, “BERT model”) is more useful.
- Open access status. OpenAlex tracks
oa_status(gold, green, hybrid, bronze, closed). Use it to surface free-to-read versions in your output. - Citation count vs. cited-by. OpenAlex computes citation counts from its own corpus. Same paper can show different counts in Google Scholar (broader) and Web of Science (narrower).
- Lag. New papers appear within weeks. Citations to those papers take longer because citing papers must themselves be indexed.
- Tied to Semantic Scholar? OpenAlex and Semantic Scholar are separate projects with separate data. Some overlap in coverage; some divergence in metadata. Use both for comprehensive lookups.
Tools
- search_works — Search scholarly articles by title, authors, or keywords. Returns title, authors, journal, publication year, citation count, and abstract.
- search_authors — Find researchers by name or institution affiliation. Returns author name, ORCID, institution, publication count, and total citations.
- search_institutions — Find academic institutions by name or location (e.g., country code ‘US’, ‘GB’). Returns institution name, country, type, publication count, and research areas.
- get_concept — Look up research fields or topics by name. Returns concept description, publication count, related concepts, and parent concepts in the academic hierarchy.
Tools
-
get_concept— Look up research fields or topics by name. Returns concept description, publication count, related concepts, and parent concepts in the academic hierarchy. -
search_authors— Find researchers by name or institution affiliation. Returns author name, ORCID, institution, publication count, and total citations. -
search_institutions— Find academic institutions by name or location (e.g., country code 'US', 'GB'). Returns institution name, country, type, publication count, and research areas. -
search_works— Search scholarly articles by title, authors, or keywords. Returns title, authors, journal, publication year, citation count, and abstract.