#Anonymization

2026-01-31

Làm thế nào để tạo dữ liệu test gần giống dữ liệu thực mà không làm rò rỉ PII (thông tin cá nhân) hoặc mất tính nguyên vẹn dữ liệu? Công cụ/script nào hỗ trợ hiệu quả cho việc anonymization dữ liệu? #dataSecurity #privacy #dataTest #anonymization #CôngNghệ #ThửNghiệm

reddit.com/r/SaaS/comments/1qs

2025-12-23

Mô hình AI nhỏ gọn (500MB) giúp ẩn danh thông tin cá nhân (PII) trong văn bản ngay trên thiết bị! 🛡️ Không cần gửi dữ liệu lên dịch vụ bên thứ ba, đảm bảo riêng tư. Dễ dàng tinh chỉnh cho ngôn ngữ khác (có ví dụ tiếng Tây Ban Nha) hoặc lĩnh vực chuyên biệt. Tuyệt vời để chia sẻ dữ liệu nhạy cảm một cách an toàn.

#AI #Anonymization #Privacy #LocalProcessing #MãHoá #BảoMật #RiêngTư #DữLiệu

reddit.com/r/LocalLLaMA/commen

Miguel Afonso Caetanoremixtures@tldr.nettime.org
2025-12-10

"The Manhattan court order requiring OpenAI to hand over 20 million anonymized ChatGPT conversations to the New York Times and other publishers as part of a copyright lawsuit is being viewed as a turning point in data privacy, and not in a good way.

We obtained a copy of the order for you here.

What appears as a routine evidence request has opened the door to something far more troubling: the normalization of mass data disclosure from private digital interactions, justified in the name of legal discovery.

Even though the court insists that users’ identifying information will be stripped away, the scope of this order is staggering.

Twenty million chat logs represent millions of individual exchanges that some users believed were confidential. These records contain not just questions or writing samples, but fragments of personal thought, sensitive health concerns, professional secrets, intimate reflections, and sometimes details that no one ever intended to share beyond a chatbot interface.

The problem lies in the illusion of anonymization."

reclaimthenet.org/court-forces

#AI #GenerativeAI #USA #OpenAI #ChatGPT #Privacy #Anonymization

2025-12-03

Tạo công cụ ẩn danh PDF - có ai thực sự cần điều này?

Một người dùng đã tạo ra trang web anonymizepdf.eu giúp loại bỏ thông tin cá nhân khỏi tệp PDF chỉ với một cú nhấp chuột. Công cụ này ra đời sau khi tác giả nghe đồng nghiệp phải dùng hàng giờ để làm thủ công bằng phần mềm như Adobe.

#PDF #Anonymization #Privacy #CongCu #DoiTuongSo

reddit.com/r/selfhosted/commen

2025-07-17
A perturbation: The blue line represents the original data, whereas the red dots represent the modified data created by adding noise.
2025-07-08

As a first post I would like to not highlight my own research, but the research of my colleagues, since I really love the way how this research began: With a question many people have asked.

As our working group researches on topics on #anonymization and #deidentification of medical and health data, the risk of identifying people or their membership in certain groups (e.g. having a specific diagnosis) is always present. On one of our retreats they decided to research the question:

Who are those adversaries, trying to do re-identify people in health data sets, what are their motives and what could be the harm?

The results can be found in the #OpenAccess paper: Health Data Re-Identification: Assessing Adversaries and Potential Harms

doi.org/10.3233/SHTI240626

#data #adversary #healthdata #medicalinformatics #privacy #dataprotection

2025-03-17

Last week was another stakeholder meeting on #DNS4EU. #Whalebone provided a short overview of the project including a timeline. Public launch is scheduled for June this year. The talk elaborates on various considerations of the new #DNS project. I was mostly interested in the deployment aspect, the #DDoS slides and the #privacy and #anonymization mechanisms.

My personal main concern with the project is the absence of resolver technology. The project plainly uses the #KnotDNS resolver. Not a bad choice, but University taught me that diversity in the backend software introduces even more resiliency. Yet, as Whalebone is a #Czech company, it is apparent why they chose #KnotDNS exclusively.

The slides are public.

2025-03-03

2- We have also shipped this standalone version in a #Docker image 🐳 based on the #frankenphp 🧟 one.  #devops, you now have no more excuses to add an #anonymization workflow on all your #cicd !

3- Last but not least, this release comes with an experimental package for #laravel ! (thanks to  @lonnytunes)
As said, this is experimental: we need feed-backs to improve this package to make it an official flavor of DbToolsBundle 🤗

2/3

2025-02-03
One issue I consider to remain with low-latency mixnets and overlay networks is that downtime can be deanonymizing.

Even if one has constant bitrate with randomly-selected short downtime/network degradation simulation, that doesn't really help when one's town loses power entirely a few times in a year or whatever and someone bothers to try and map the downtimes onto known locations of power outages over the same year.

Is there any sensible model for handling this failure case?

#Mixnet #TimingAnalysis #SideChannels #Anonymization #Deanonymization #WhyNoDirectTagEditingInAPObjectsYet
Miguel Afonso Caetanoremixtures@tldr.nettime.org
2024-12-27

"This article uses the case study of an insurance product linked to a health and wellbeing program—the Vitality scheme—as a lens to examine the limited regulation of collection and use of non-personal (de-identified/anonymised) information and the impacts it has on individuals, as well as society at large. Vitality is an incentive-based engagement program that mobilises online assessment tools, preventive health screening, and physical activity and wellness tracking through smart fitness technologies and apps. Vitality then uses the data generated through these activities, mainly in an aggregated, non-personal form, to make projections about changes in behaviour and future health outcomes, aiming at reducing risk in the context of health, life, and other insurance products. Non-personal data has been traditionally excluded from the scope of legal protections, and in particular privacy and data regimes, as it is thought not to contain information about specific, identifiable people, and thus its potential to affect individuals in any meaningful way has been understood to be minimal. However, digitalisation and ensuing ubiquitous data collection are proving these traditional assumptions wrong. We show how the response of the legal systems is limited in relation to non-personal information collection and use, and we argue that irrespective of the (possibly) beneficial nature of insurance innovation, the current lack of comprehensive regulation of non-personal data use potentially leads to individual, collective and societal data harms, as the example of the Vitality scheme illustrates."

sciencedirect.com/science/arti

#Australia #HealthInsurance #Anonymization #Privacy #DataProtection #GDPR #Insurance

André Ourednikandre_ourednik
2024-03-26

Emotion-Aware Face De-identification with Generative Adversarial Networks

A very interesting take on photograph

researchgate.net/publication/3

2024-02-06

The idea behind this bundle is to make the life of @symfony
developers easier in executing routine database-related operations, including backup, restoration.

But the killer feature is #anonymization.

2023-10-23
2023-10-20

@ad_on_is The truth is that the #internet should be considered useful as a WAN routing layer, but nothing should actually be using it directly as it is unreliable both because of malicious actors and because of gross neglect by many operators.

Support for #AsynchronousCommunication (which #Usenet demonstrated) is an essential property. #Anonymization, #P2P routing & key-addressing are also minimum requirements to hinder trivial censorship. #Mixnet operation is even better.

∂𑁨í 🕊 d2i@mk.phreedom.club
2023-10-05

anti keystroke deanonymization tool.

#kloak : Keystroke-level online #anonymization #kernel.

A privacy tool that makes keystroke biometrics less effective. This is accomplished by obfuscating the time intervals between key press and release events, which are typically used for identification.

https://github.com/vmonaco/kloak

2023-09-05

New study: Australian #cancer patients support the sharing of anonymized research #data on themselves. Their #consent jumps from 50% to 80% after they see a "visual representation" of how the anonymized data will be shared.
medrxiv.org/content/10.1101/20

#Anonymization #Australia #Medicine #OpenData #Privacy #Visualizations

2023-08-25

@atoponce @angeld23 One can setup a VM with a VPN to use as a local seedbox.

It can be significantly cheaper than renting a seedbox.

But more generally #filesharing should be done on #anonymization networks like #I2P which supports a safer variant of the #torrent protocol.

2023-08-25

@ellenor2000 It makes assumptions that ignore the dubious quality of USA/Canada internet infrastructure outside of large cities (or nearly everywhere for mobile internet in Canada) and it's also annoying for users of various #anonymization networks like #Tor / #I2P (sure one can do multi-megabyte per second, but sometimes one will also do 12kbps depending on the route to a host).

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst