trustworthiness

Here are 13 public repositories matching this topic...

chenglin1112 / AgentTrust

Real-time trustworthiness evaluation and safety interception for AI agents. Semantic analysis, safe alternative suggestions, multi-step attack chain detection, and LLM-as-Judge.

python agent security benchmark mcp ai-safety trustworthiness guardrails llm

Updated May 2, 2026
Python

DeFacto / WebCredibility

Star

Provides web credibility models (Likert scale) to assign a trustworthiness score to a given website.

trust credibility fact-checking trustworthiness web-credibility

Updated Sep 19, 2019
Python

tpertner / squeeze

Star

Squeeze your model with pressure prompts to see if its behavior leaks.

reliability evaluation calibration alignment quality-assurance metamorphic-testing ai-safety trustworthiness hallucinations prompt-engineering llm-eval llm-evals

Updated Mar 1, 2026
Python

merrafelice / Semantic-Aware-Shilling-Attacks

Star

In this paper, we introduce SAShA, a new attack strategy that leverages semantic features extracted from a knowledge graph in order to strengthen the efficacy of the attack to standard CF models. We performed an extensive experimental evaluation in order to investigate whether SAShA is more effective than baseline attacks against CF models by ta…

security semantic-web knowledge-graph recommender-system shilling-attack trustworthiness

Updated Feb 10, 2022
Python

jorge-martinez-gil / rail

Star

A Reliability-Aware Ingress Layer for Human Feedback in Stream Analytics

streaming data-engineering data-management human-in-the-loop trustworthiness data-engineering-pipeline model-decay

Updated May 11, 2026
Python

rajdeep345 / MTLTS

Star

Codes and Datasets for our WSDM 2022 Paper: "MTLTS: A Multi-Task Framework To Obtain Trustworthy Summaries From Crisis-Related Microblogs"

verification summarization trustworthiness rumor-detection trustworthy-ai

Updated Feb 26, 2022
Python

merrafelice / TAaMR

Star

Proposal of a novel adversarial attack approach, called Target Adversarial Attack against Multimedia Recommender Systems (TAaMR), to investigate the modification of MR behavior when the images of a category of low recommended products (e.g., socks) are perturbed to misclassify the deep neural classifier towards the class of more recommended prod…

security recommender-system trustworthiness adversarial-attacks

Updated Feb 11, 2021
Python

Devchandrasen / tudt-f-smart-energy

Star

Trust-gated smart-energy digital twin framework with uncertainty-aware decision support

smart-grid microgrid pandapower trustworthiness conformal-prediction digital-twin energy-systems

Updated May 12, 2026
Python

liuboxuan20010613 / trust-cost-bench

Star

benchmark verification ai-safety trustworthiness ai-evaluation llm rlhf ai-trust

Updated Mar 23, 2026
Python

SESARLab / big-data-trustworthiness

Star

An Assurance Process for Big Data Trustworthiness - Marco Anisetti, Claudio A. Ardagna, Filippo Berto

big-data assurance trustworthiness

Updated Apr 26, 2022
Python

eclipse-aerios / trust-manager

Star

A module for monitoring and evaluating the trustworthiness of an Infrastructure Element of the Cloud-Edge-IoT continuum

trustworthiness trust-score aerios infrastructure-element computing-node trust-algorithm

Updated Feb 18, 2026
Python

merrafelice / Assessing-Perceptual-and-Recommendation-Mutation-of-Adversarially-Poisoned-Visual-Recommenders

Star

In this work, we provide 24 combinations of attack/defense strategies, and visual-based recommenders to 1) access performance alteration on recommendation and 2) empirically verify the effect on final users through offline visual metrics.

deep-learning recommender-system human-in-the-loop trustworthiness adversarial-attacks

Updated Feb 11, 2021
Python

eclipse-aerios / iota-messages-api

Star

REST API to insert messages into an IOTA Tangle

rest-api trustworthiness iota-tangle aerios iota-messages-api

Updated Dec 4, 2025
Python

Improve this page

Add a description, image, and links to the trustworthiness topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trustworthiness topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trustworthiness

Here are 13 public repositories matching this topic...

chenglin1112 / AgentTrust

DeFacto / WebCredibility

tpertner / squeeze

merrafelice / Semantic-Aware-Shilling-Attacks

jorge-martinez-gil / rail

rajdeep345 / MTLTS

merrafelice / TAaMR

Devchandrasen / tudt-f-smart-energy

liuboxuan20010613 / trust-cost-bench

SESARLab / big-data-trustworthiness

eclipse-aerios / trust-manager

merrafelice / Assessing-Perceptual-and-Recommendation-Mutation-of-Adversarially-Poisoned-Visual-Recommenders

eclipse-aerios / iota-messages-api

Improve this page

Add this topic to your repo